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METHODS FOR TRANSFORMING PLASTIDS 



5 

FIELD OF THE INVENTION 



This invention relates to the application of genetic engineering techniques to 
plants. More specifically, the invention relates to methods for the transformation of 
10 plant cell plastids. 



BACKGROUND 



15 The plastids of higher plants are an attractive target for genetic engineering. 

Plant plastids (chloroplasts, amyloplasts, elaioplasts, chromoplasts, etc.) are the major 
biosynthetic centers that in addition to photosynthesis are responsible for production 
of industrially important compounds such as amino acids, complex carbohydrates, 
fatty acids, and pigments. Plastids are derived from a common precursor known as a 

2 0 proplastid and thus the plastids present in a given plant species all have the same 
genetic content. Plant cells contain 500-10,000 copies of a small 120-1 60 kilobase 
circular genome, each molecule of which has a large (approximately 25kb) inverted 
repeat. Thus, it is possible to engineer plant cells to contain up to 20,000 copies of a 
particular gene of interest which potentially can result in very high levels of foreign 

2 5 gene expression. 

Current plastid transformation methods are inefficient, as such there is need 
for constructs and methods which improve plastid transformation. 



SUMMARY OF THE INVENTION 

30 

By this invention, methods which allow for the improved transformation of a 
foreign DNA into plant cell plastids are provided. Such methods generally involve 
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utilizing a plant tissue source which contains cells with an altered plastid morphology 
in the transformation methods. The alteration in the plant plastid morphology 
includes, inter alia, plastid size and number. By utilizing tissue derived from such 
plants in plastid transformation methods, efficiency of transformation of a foreign 
5 DNA into the plant cell plastid may be increased. 

As exemplified herein, constructs useful for genetic engineering of plant cells 
to provide for a method of increasing plastid transformation efficiency are provided. 
The constructs include nucleic acid sequences coding for protein sequences involved 
in controlling division of plant cell organelles. The expression of such nucleic acid 
1 0 sequences in a plant cell provides for an altered number and/or size of the chloroplasts 

within the host cell. 

DNA sequences, also referred to herein as polynucleotides, for use in 
transformation contain an expression construct comprising a promoter region which is 
functional in a plastid, and a DNA sequence encoding a gene involved in controlling 
15 the division of plant cell organelles. 

Methods for the use of transformed plants with altered plastid morphology are 
described. Such methods include plant breeding or transformation methods to provide 
plant cells having both the nuclear and plastid constructs. 

The present invention also provides methods for increasing the efficiency of 
2 0 chloroplast transformation. The method generally comprises transforming the plastids 
of a plant tissue which has been modified to have an altered number and/or size of 
plastids contained within the plant cell. 

The present invention also provides a mechanism for enhancing the efficiency 
of chloroplast transformation in plant species. 

2 5 The present invention also provides methods for improving the selectability of 

plant comprising, transforming a plant cell source having an altered plastid 
morphology with a construct comprising a promoter functional in a plant cell plastid 
operably associated with a nucleic acid sequence encoding a selectable marker. 
Selectable markers of interest in the present invention include herbicide tolerance 

3 0 genes such as glyphosate tolerance genes, and antibiotic resistance genes. Glyphosate 

tolerance genes include the CP4 gene from Agrobactenum. 
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Another aspect of the present invention provides methods for preparing a 
plant cell source with increased plastid transformation efficiency comprising, 
transforming a plant cell with a construct comprising a promoter functional in plant 
cell operably associated with a nucleic acid sequence encoding aFtsZ protein. 
5 Also considered part of this invention are the plants and plant cells obtained 

using the methods described herein. 



DESCRIPTION OF THE FIGURES 

1 0 Figure 1 provides an amino acid sequence alignment of the Arabidopsis FtsZl 

(SEQ ID NO:2), the Brassica FtsZl (SEQ ID NO:6), the tobacco FtsZl (SEQ ID 
NO:9), the Soybean FtsZl (SEQ ID NO:72) and the corn FtsZl (SEQ ID NO:73) 
protein sequences. 



15 



DETAILED DESCRIPTION OF THE INVENTION 



In accordance with the subject invention, methods are provided which allow 
for the improved transformation of a foreign DNA into plant cell plastids. Such 

2 0 methods generally involve utilizing a plant cell source which contains an altered plant 
plastid morphology. By utilizing tissue derived from such plants in plastid 
transformation methods, efficiency of transformation of a foreign DNA into the plant 
cell plastid can be increased. 

In one embodiment of the instant invention, plant tissue containing altered 

2 5 plant plastid morphology is used for plastid transformation methods. Such alterations 
in plant plastid morphology include, but are not limited to. alterations in the plastid 
size, shape and number in respect to a wild-type plastid morphology from the target 
plant cell. In general, a wild-type plastid morphology consists of small, round 
organelles contained within the plant cell, depending on the species. Furthermore, a 

30 plant cell typically contains between about 50 and about 1 00 plastids. 

The plant tissue source used in plastid transformation methods of the present 
invention contains an increase in the size of the plastids contained in the plant cells. 
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Such increases in the size of the piastids provides for a larger surface area for the 
foreign DNA to penetrate the plastid membrane during transformation. 

The iarge piastids preferably contain approximately the same number of 
plastid genomes as would be contained in corresponding number of wild-type 
5 piastids. For example, in a wild-type plant cell containing 100 piastids per cell and 
100 copies of the plastid genome in each plastid (a total of 10,000 copes of the 
plastid genome per cell), the corresponding mutant tissue source would preferably 
contain about the same number of plastid genomes, only contained in one, or several 
large plastid(s). 

10 Alternatively, a plant tissue source with an increased number of piastids, with 

a corresponding reduced size, can also find use in the plastid transformation methods 

of the present invention. 

As is understood in the art, additional methods for obtaining plants with 
alterations in the plastid size and number are known. The skilled artisan will 
15 recognize that a number of methods are available for providing for an alteration in 
plastid cell division. Such methods arc described, for example, by Strepp. et aL 
(1998) Proc. Natl. Acad. Sci. USA, 95:4368^373. 

Cell division, also referred to as cytokinesis, has been the focus of studies in 
many organisms such as bacterial, fungal, and animal cells. Division of bacterial cells 
2 0 occurs through the formation of an FtsZ ring (also referred to as a Z ring) at the site of 
division (Lutkenhaus, etal. (1997) Ann. Rev. Biochem. 66:93-1 16). The positioning 
and formation of the Z ring acts to further drive septation (cytokinesis). The ring is 
composed of a tubulin-like FtsZ protein which has GTPase activity. Mutations in the 
ftsL gene in E. coli leads to the production of a temperature-sensitive filaments with 

2 5 regularly spaced nucleoids at certain temperatures (Lutkenhaus ( 1992) In Prokaryotic 

Structure and Function: A New Perspective, ed. S Mohan, C Dow, pp 123-152. 
Cambridge: Cambridge Univ. Press). Such mutations in bacteria leads to the inability 
to divide correctly. 

The plant cell plastid as well as the mitochondria are derived from prokaryotic 

3 0 ancestors, and thus, the division apparatus of these organelles resembles that of 

bactena. Recently, identification of/ttZ related sequences in Arabidopsis and 
Physcomitrella patens have been reported (Osteryoung, et al. (1995) Nature, 376:473- 
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474; and Strcpp. et al. (1998), supra). The protein encoded by the Arabidopsis ftsZ 
gene was found to be imported into the chloroplast and was therefore speculated to be 
a component of the plastid division machinery (Osteryoung, et al. (1995), supra). 
More recently, the involvement of FtsZ in plastid division was directly demonstrated. 
5 The disruption of the ftsZ gene in a lower plant, Physcomitrella patens, impeded 
plastid division, thereby giving rise to mutant cell lines with one or a few large 
plastids (Strepp, et al. (1998), supra). 

The use of plants with an altered number and/or size of plastids containing one 
or few large plastids could therefore be used as targets for plastid transformation of 

1 0 any plant species. Such plants containing an altered size and/or number can be 

obtained using various methods, including mutagenesis, antisense suppression, or co- 
suppression. Methods for the mutagenesis of plant genomes are well known in the art, 
and include chemical, such as ethylmethane sulfonate (EMS) and nitrosoguanidine 
(NTG), as well as physical mutagenesis methods such as fast neutron bombardment. 

15 Other means for obtaining a plant source with an alteration in the size and/or 

number of plastids contained in the cell are also contemplated. For example, tissue 
for use in the transformation methods of the present invention can be obtained from 
plants grown in culture conditions which provide for such altered plastid content. For 
example, tissue obtained from plants grown in vitro under culture conditions in which 

20 inhibitors of bacterial cell division, such as 5,5*-bis-(8-aniUno-l-naphtalenesulfonate) 
(Yu, etal. (1998) J. Biol Chem. 273:10216-10222), are present, can be utilized as a 
cell source for the plastid transformation methods of the present invention. 

In a preferred embodiment, such plants containing cells with an alteration in 
the size and/or number of plastids are generated by anti-sense expression of the FtsZ 

2 5 gene. Once plastid transformation is achieved and homoplasmic plants are identified, 

the anti-sense transgene can be eliminated by out-crossing and the wild-type condition 
of 50 to 100 plastids per cell restored. Similarly, plants regenerated from plastid 
transformed tissue containing an altered number and/or size of plastids from 
mutations can also be reverted to the wild-type plastid conditions using such out- 

3 0 crossing methods. 
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In the case of the use of culture conditions for obtaining plant cells with an 
altered number and/or size of plastids, wild-type plastids can be obtained by releasing 
the tissue from such culture conditions. 

In another embodiment of the present invention, novel nucleic acid sequences 
5 are provided which encode proteins related to proteins involved in bacterial cell and 
plastid division. 

In particular, novel nucleic acid sequences from Arabidopsis, soybean, corn, 
Brassica are provided which encode FtsZ related proteins. Such nucleic acid 
sequences find use in the preparation of DNA constructs. Such constructs find use in 

10 the production of plants with an altered number and/or size of chloroplasts. 

The skilled artisan will recognize that other DNA sequences useful for the 
production of plants with an altered number and/or size of chloroplasts are available 
in the art. The sequences include but are not limited lo,ftsA,ftsL,fisl.ftsQ,ftsii, 
ftsW,ftsK (Lutkenhaus, et al. (1997) supra), and the arc genes (Pyke, et al. (1992) 

15 Plant Physiol. 99:1005-1008; Pyke et al. (1994) Plant Physiol. 104:201-207; and 
Pyke (1997) Am. /. Botany 84:1017-1027). 

In order to obtain additional ftsZ sequences, a genomic or other appropriate 
library prepared from the candidate plant source of interest can be probed with 
conserved sequences from one or more plant and/or bacterial/ttZ sequence(s) to 

2 0 identify homologously related sequences. Positive clones can be analyzed by 

restriction enzyme digestion and/or sequencing. When a genomic library is used, one 
or more sequences can be identified providing both the coding region, as well as the 
transcriptional regulatory elements of the/ttZ gene from such plant source. Probes 
can also be considerably shorter than the entire sequence. Oligonucleotides can be 

2 5 used, for example, but should be at least about 10, preferably at least about 1 5, more 
preferably at least 20 nucleotides in length. When shorter length regions are used for 
comparison, a higher degree of sequence identity is required than for longer 
sequences. Shorter probes are often particularly useful for polymerase chain reactions 
(PCR), especially when highly conserved sequences can be identified. (See, Gould, et 

30 al., PNAS USA (1989) 86:1934-1938.) 

When longer nucleic acid fragments (>100 bp) are employed as probes, 
especially when using complete or large cDNA sequences, one can still screen with 
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moderately high stringencies (for example using 50% formamide at 37°C with 
minimal washing) in order to obtain signal from the target sample with 20-50% 
deviation, i.e., homologous sequences. (For additional information regarding 
screening techniques see Beltz, el ai, Meth. Enzymology ( 1983) 700:266-285). 
5 Another aspect of the present invention relates to isolated FtsZ 

polynucleotides. The polynucleotide sequences of the present invention include 
isolated polynucleotides that encode the polypeptides of the invention having a 
deduced amino acid sequence selected from the group of sequences set forth in the 
Sequence Listing and to other polynucleotide sequences closely related to such 

10 sequences and variants thereof. 

The invention provides a polynucleotide sequence identical over its entire 
length to each coding sequence as set forth in the Sequence Listing. The invention 
also provides the coding sequence for the mature polypeptide or a fragment thereof, as 
well as the coding sequence for the mature polypeptide or a fragment thereof in a 

1 5 reading frame with other coding sequences, such as those encoding a leader or 

secretory sequence, a pre-, pro-, orprepro- protein sequence. The polynucleotide can 
also include non-coding sequences, including for example, but not limited to, non- 
coding 5* and 3' sequences, such as the transcribed, untranslated sequences, 
termination signals, ribosome binding sites, sequences that stabilize mRNA, introns, 

2 0 polyadenylation signals, and additional coding sequence that encodes additional 
amino acids. For example, a marker sequence can be included to facilitate the 
purification of the fused polypeptide. Polynucleotides of the present invention also 
include polynucleotides comprising a structural gene and the naturally associated 
sequences that control gene expression. 

2 5 The invention also includes polynucleotides of the formula: 

X-(R,) n -(R2)-(R3)n-Y 

wherein, at the 5' end, X is hydrogen, and at the 3' end, Y is hydrogen or a metal, Ri 
and R 3 are any nucleic acid residue, n is an integer between 1 and 3000, preferably 
between 1 and 1000 and R 2 is a nucleic acid sequence of the invention, particularly a 

3 0 nucleic acid sequence selected from the group set forth in the Sequence Listing and 

preferably SEQ ID NOs:l,3,5,7,8,and 10-31. In the formula, R2 is oriented so that its 
5' end residue is at the left, bound to R,. and its 3' end residue is at the right, bound to 
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R 3 . Any stretch of nucleic acid residues denoted by cither R group, where R is greater 
than 1, may be either a heteropolymer or a homopolymer, preferably a heteropolymer. 

The invention also relates to variants of the polynucleotides described herein 
that encode for variants of the polypeptides of the invention. Variants that are 
5 fragments of the polynucleotides of the invention can be used to synthesize full-length 
polynucleotides of the invention. Preferred embodiments are polynucleotides 
encoding polypeptide variants wherein 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid 
residues of a polypeptide sequence of the invention are substituted, added or deleted, 
in any combination. Particularly preferred are substitutions, additions, and deletions 

1 0 that are silent such that they do not alter the properties or activities of the 
polynucleotide or polypeptide. 

Further preferred embodiments of the invention that are at least 50%, 60%, or 
70% identical over their entire length to a polynucleotide encoding a polypeptide of 
the invention, and polynucleotides that are complementary to such polynucleotides. 

1 5 More preferable are polynucleotides that comprise a region that is at least 80% 
identical over its entire length to a polynucleotide encoding a polypeptide of the 
invention and polynucleotides that are complementary thereto. In this regard, 
polynucleotides at least 90% identical over their entire length are particularly 
preferred, those at least 95% identical are especially preferred. Further, those with at 

2 0 least 97% identity are highly preferred and those with at least 98% and 99% identity 
are particularly highly preferred, with those at least 99% being the most highly 
preferred. 

Preferred embodiments are polynucleotides that encode polypeptides that 
retain substantially the same biological function or activity as the mature polypeptides 

2 5 encoded by the polynucleotides set forth in the Sequence Listing. 

The invention further relates to polynucleotides that hybridize to the above- 
described sequences. In particular, the invention relates to polynucleotides that 
hybridize under stringent conditions to the above -described polynucleotides. As used 
herein, the terms "stringent conditions" and "stringent hybridization conditions" mean 

3 0 that hybridization will generally occur if there is at least 95% and preferably at least 

97% identity between the sequences. An example of stringent hybridization 
conditions is overnight incubation at 42°C in a solution comprising 50% formamide, 

8 
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5x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 
7.6). 5x Denhardfs solution, 10% dextran sulfate, and 20 micrograms/milliliter 
denatured, sheared salmon sperm DNA, followed by washing the hybridization 
support in 0.1 x SSC at approximately 65°C. Other hybridization and wash conditions 
5 are well known and are exemplified in Sambrook, et a/.. Molecular Cloning: A 
Laboratory Manual, Second Edition, cold Spring Harbor, NY (1989), particularly 
Chapter 1 1 . 

The invention also provides a polynucleotide consisting essentially of a 
polynucleotide sequence obtainable by screening an appropriate library containing the 

1 0 complete gene for a polynucleotide sequence set for in the Sequence Listing under 
stringent hybridization conditions with a probe having the sequence of said 
polynucleotide sequence or a fragment thereof; and isolating said polynucleotide 
sequence. Fragments useful for obtaining such a polynucleotide include, for example, 
probes and primers as described herein. 

15 As discussed herein regarding polynucleotide assays of the invention, for 

example, polynucleotides of the invention can be used as a hybridization probe for 
RNA, cDNA, or genomic DNA to isolate full length cDN As or genomic clones 
encoding a polypeptide and to isolate cDNA or genomic clones of other genes that 
have a high sequence similarity to a polynucleotide set forth in the Sequence Listing. 

20 Such probes will generally comprise at least 15 bases. Preferably such probes will 
have at least 30 bases and can have at least 50 bases. Particularly preferred probes 
will have between 30 bases and 50 bases, inclusive. 

The coding region of each gene that comprises or is comprised by a 
polynucleotide sequence set forth in the Sequence Listing may be isolated by 

2 5 screening using a DNA sequence provided in the Sequence Listing to synthesize an 

oligonucleotide probe. A labeled oligonucleotide having a sequence complementary 
to that of a gene of the invention is then used to screen a library of cDNA, genomic 
DNA or mRNA to identify members of the library which hybridize to the probe. For 
example, synthetic oligonucleotides are prepared which correspond to theFtsZ EST 

3 0 sequences . The oligonucleotides are used as primers in polymerase chain reaction 

(PCR) techniques to obtain 5' and 3' terminal sequence of FtsZ genes. Alternatively, 
where oligonucleotides of low degeneracy can be prepared from particular FtsZ 
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peptides, such probes may be used directly to screen gene libraries forFtsZ gene 
sequences. In particular, screening of cDNA libraries in phage vectors is useful in 
such methods due to lower levels of background hybridization. 

Typically, aFtsZ sequence obtainable from the use of nucleic acid probes will 
5 show 60-70% sequence identity between the target FtsZ sequence and the encoding 
sequence used as a probe. However, lengthy sequences with as little as 50-60% 
sequence identity may also be obtained. The nucleic acid probes may be a lengthy 
fragment of the nucleic acid sequence, or may also be a shorter, oligonucleotide probe. 
When longer nucleic acid fragments are employed as probes (greater than about 100 
1 0 bp), one may screen at lower stringencies in order to obtain sequences from the target 
sample which have 20-50% deviation (i.e., 50-80% sequence homology) from the 
sequences used as probe. Oligonucleotide probes can be considerably shorter than the 
entire nucleic acid sequence encoding an FtsZ enzyme, but should be at least about 10, 
preferably at least about 1 5, and more preferably at least about 20 nucleotides. A 
15 higher degree of sequence identity is desired when shorter regions are used as opposed 
to longer regions. It may thus be desirable to identify regions of highly conserved 
amino acid sequence to design oligonucleotide probes for detecting and recovering 
other related FtsZ genes. Shorter probes are often particularly useful for polymerase 
chain reactions (PCR), especially when highly conserved sequences can be identified. 
20 (See, Gould, et ai., PNAS USA (1989) 56:1934-1938.). 

Another aspect of the present invention relates to FtsZ polypeptides. Such 
polypeptides include isolated polypeptides set forth in the Sequence Listing, as well as 
polypeptides and fragments thereof, particularly those polypeptides which exhibit 
FtsZ activity and also those polypeptides which have at least 50%, 60% or 70% 

2 5 identity, preferably at least 80% identity, more preferably at least 90% identity, and 

most preferably at least 95% identity to a polypeptide sequence selected from the 
group of sequences set forth in the Sequence Listing, and also include portions of such 
polypeptides, wherein such portion of the polypeptide preferably includes at least 30 
amino acids and more preferably includes at least 50 amino acids. 

3 0 "Identity", as is well understood in the art, is a relationship between two or 

more polypeptide sequences or two or more polynucleotide sequences, as determined 
by comparing the sequences. In the art. "identity" also means the degree of sequence 

10 
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relatedness between polypeptide or polynucleotide sequences, as determined by the 
match between strings of such sequences. "Identity" can be readily calculated by 
known methods including, but not limited to, those described in Computational 
Molecular Biology, Lesk, A.M., ed., Oxford University Press, New York (1988); 
5 Biocomputing: Informatics and Genome Projects, Smith, D.W., ed.. Academic Press, 
New York, 1993; Computer Analysis of Sequence Data, Parti, Griffin, A.M. and 
Griffin, H.G., eds., Humana Press, New Jersey (1994); Sequence Analysis in 
Molecular Biology, von Heinje, G., Academic Press (1987); Sequence Analysis 
Primer, Gribskov, M. and Devereux, J., eds., Stockton Press, New York (1991); and 

10 Carillo, H., and Lipman, D., SIAM J Applied Math, 48:1073 (1988). Methods to 
determine identity are designed to give the largest match between the sequences 
tested. Moreover, methods to determine identity are codified in publicly available 
programs. Computer programs which can be used to determine identity between two 
sequences include, but are not limited to, GCG (Devereux, J., et al.. Nucleic Acids 

15 Research 12(1):387 (1984); suite of five BLAST programs, three designed for 

nucleotide sequences queries (BLASTN, BLASTX. and TBLASTX) and two designed 
for protein sequence queries (BLASTP and TBLASTN) (Coulson, Trends in 
Biotechnology, 12: 76-80 (1994); Birren, etal. Genome Analysis, 1: 543-559 (1997)). 
The BLAST X program is publicly available from NCBI and other sources (BLAST 

2 0 Manual, Altschul, S., et al., NCBI NLM NIH, Bethesda, MD 20894; Altschul, S., et 
al., J. Mol. Biol., 215:403-410 (1990)). The well known Smith Waterman algorithm 
can also be used to determine identity. 

Parameters for polypeptide sequence comparison typically include the 

following: 

2 5 Algorithm: Needleman and Wunsch, J. Mol. Biol. 48:443-453 (1970) 

Comparison matrix: BLOSSUM62 from Hentikoff and Hentikoff, Proc. Natl. 
Acad. Sci USA 89:10915-10919 (1992) 
Gap Penalty: 12 
Gap Length Penalty: 4 

3 0 A program which can be used with these parameters is publicly available as 

the "gap" program from Genetics Computer Group, Madison Wisconsin. The above 



11 
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parameters along with no penalty for end gap are the default parameters for peptide 
comparisons. 

Parameters for polynucleotide sequence comparison include the following: 
Algorithm: Needleman and Wunsch, J. Mol. Biol. 48:443-453 (1970) 
5 Comparison matrix: matches = +10; mismatches = 0 

Gap Penalty: 50 
Gap Length Penalty: 3 

A program which can be used with these parameters is publicly available as 
the "gap" program from Genetics Computer Group, Madison Wisconsin. The above 
1 0 parameters are the default parameters for nucleic acid, comparisons. 

The invention also includes polypeptides of the formula: 
X-(R,)„-(R 2 )-(R3)n-Y 
wherein, at the amino terminus, X is hydrogen, and at the carboxyl terminus, Y is 
hydrogen or a metal, R, and R 3 are any amino acid residue, n is an integer between 1 
15 and 1000, and R 2 is an amino acid sequence of the invention, particularly an amino 

acid sequence selected from the group set forth in the Sequence Listing and preferably 
SEQ ID NOs: 2,4,6, and 9. In the formula, R 2 is oriented so that its amino terminal 
residue is at the left, bound to R,, and its carboxy terminal residue is at the right, 
bound to R 3 . Any stretch of amino acid residues denoted by cither R group, where R 
20 is greater than 1 , may be either a heteropolymer or a homopolymer, preferably a 
heteropolyrner. 

Polypeptides of the present invention include isolated polypeptides encoded by 
a polynucleotide comprising a sequence selected from the group of a sequence 
contained in the Sequence Listing set forth herein . 

2 5 The polypeptides of the present invention can be mature protein or can be part 

of a fusion protein. 

Fragments and variants of the polypeptides are also considered to be a part of 
the invention. A fragment is a variant polypeptide which has an amino acid sequence 
that is entirely the same as part but not all of the amino acid sequence of the 

3 0 previously described polypeptides. The fragments can be "free-standing" or 

comprised within a larger polypeptide of which the fragment forms a pan or a region, 
most preferably as a single continuous region. Preferred fragments are biologically 

12 



SUBSTITUTE SHEET (RULE 26) 



WO 00/32799 PCT/US99/28103 

active fragments which are those fragments that mediate activities of the polypeptides 
of the invention, including those with similar activity or improved activity or with a 
decreased activity. Also included are those fragments that antigenic orimmunogenic 
in an animal, particularly a human. 
5 Variants of the polypeptide also include polypepudes that vary from the 

sequences set forth in the Sequence Listing by conservative amino actd substitutions, 
substitution of a residue by another with like characteristics. In general, such 
substitutions are among Ala, Val, Leu and He; between Ser and Thr; between Asp and 
Glu; between Asn and Gin; between Lys and Arg; or between Phe andTyr. 
10 Particularly preferred are variants in which 5 to 10; 1 to 5; 1 to 3 or one amino acid(s) 
are substituted, deleted, or added, in any combination. 

Variants that are fragments of the polypeptides of the invention can be used to 
produce the corresponding full length polypeptide by peptide synthesis. Therefore, 
these variants can be used as intermediates for producing the full-length polypeptides 

15 of the invention. 

The polynucleotides and polypeptides of the invention can be used, for 
example, in the transformation of host cells, such as plant host cells, as further 
discussed herein. 

The invention also provides polynucleotides that encode a polypeptide that is a 
2 0 mature protein plus additional amino or carboxyl-terminal amino acids, or amino 
acids within the mature polypeptide (for example, when the mature form of the 
protein has more than one polypeptide chain). Such sequences can, for example, play 
a role in the processing of a protein from a precursor to a mature form, allow protein 
transport, shorten or lengthen protein half-life, or facilitate manipulation of the protein 
25 in assays or production. It is contemplated that cellular enzymes can be used to 
remove any additional amino acids from the mature protein. 

A precursor protein, having the mature form of the polypeptide fused to one or 
more prosequences may be an inactive form of the polypeptide. The inactive 
precursors generally are activated when the prosequences are removed. Some or all of 
30 the prosequences may be removed prior to activation. Such precursor protein are 
generally called proproteins. 
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Homologous sequences are found when there is an identity of sequence and 
can he determined upon comparison of sequence information, nucleic acid or amino 
acid, or through hybridization reactions between a known FtsZ and a candidate source. 
Conservative changes, such as Glu/Asp, Val/Ile, Ser/Thr, Arg/Lys andGln/Asn can 
5 also be considered in determining sequence homology. Typically, a lengthy nucleic 
acid sequence can show as little as 50-60% sequence identity, and more preferably at 
least about 70% sequence identity, between the target sequence and the given FtsZ 
sequence of interest excluding any deletions which can be present, and still be 
considered related. Amino acid sequences are considered homologous by as little as 
10 25% sequence identity between the two complete mature proteins. {See generally, 
Doolittle, R.F., OF URFS and ORFS (University Science Books, CA, 1986.) 

In addition, not only can sequences provided herein be used to identify 
homologous FtsZ sequences, but the resulting sequences obtained therefrom can also 
provide a further method to obtain FtsZ sequences from other plant and/or bacterial 
1 5 sources. In particular, PCR can be a useful technique to obtain related FtsZ sequences 
from sequence data provided herein. One skilled in the art will be able to design 
oligonucleotide probes based upon sequence comparisons or regions of typically 
highly conserved sequence. 

Once the nucleic acid sequence is obtained, the transcription, or transcription 
20 and translation (expression), of the FtsZ sequence in a host cell is desired to produce a 
ready source of the enzyme and/or modify the number and/or size of the plastids 
found therein. Other useful applications can be found when the host cell is a plant 
host cell, in vitro and in vivo. 

Nucleic acids (genomic DNA, plasmid DNA. cDNA, synthetic DNA, mRNA, 

2 5 etc.) encoding FtsZ or amino acid sequences of the purified enzymes, which permit 

design of nucleic acid probes facilitating the isolation of DNA coding sequences 
therefor, are known in the art and are available for use in the methods of the present 
invention. It is generally recognized to an artisan skilled in the field to which the 
present invention pertains that the nucleic acid sequences provided herein and the 

3 0 amino acid sequences derived therefrom can be used to isolate other potential FtsZ 

genes from GenBank using DNA and peptide search techniques generally known in 
the an. 

14 



SUBSTITUTE SHEET (RULE 26) 



WO 00/32799 PCT/US99/28103 

In addition to the sequences described in the present invention, DNA coding 
sequences useful in the present invention can be derived from algae, fungi, bacteria, 
plants, etc. Homology searches in existing databases using signature sequences 
corresponding to conserved nucleotide and amino acid sequences of FlsZ can be 
5 employed to isolate equivalent, related genes from other sources such as plants and 
microorganisms. Searches in EST databases can also be employed. Furthermore, the 
use of DNA sequences encoding enzymes functionally enzymatically equivalent to 
those disclosed herein, wherein such DNA sequences are degenerate equivalents of 
the nucleic acid sequences disclosed herein in accordance with the degeneracy of the 

10 genetic code, is also encompassed by the present invention. Demonstration of the 
functionality of coding sequences identified by any of these methods can be carried 
out by complementation of mutants of appropriate organisms, such as E. coli. The 
sequences of the DNA coding regions can be optimized by gene resynthesis, based on 
codon usage, for maximum expression in particular hosts. 

15 The nucleic acid sequences which encode FtsZ can be used in various 

constructs, for example, as probes to obtain further sequences. Alternatively, these 
sequences can be used in conjunction with appropriate regulatory sequences to 
increase levels of the respective FtsZ sequence of interest in a host cell for recovery or 
study of the enzyme in vitro or in vivo or to decrease levels of the respective FtsZ 

2 0 sequence of interest for some applications when the host cell is a plant entity, 
including plant cells, plant parts (including but not limited to seeds, cuttings or 
tissues) and plants. 

Thus, depending upon the intended use, the constructs can contain the nucleic 
acid sequence which encodes the entire FtsZ protein, or a portion thereof. For 

2 5 example, where antisense inhibition of a given FtsZ protein is desired, the entire FtsZ 

sequence is not required. Furthermore, where FtsZ constructs are intended for use as 
probes, it can be advantageous to prepare constructs containing only a particular 
portion of a FtsZ encoding sequence, for example a sequence which is discovered to 
encode a highly conserved FtsZ region. 

3 0 As discussed above, nucleic acid sequence encoding a plant or other FtsZ 

proteins of this invention can include genomic, cDNA or mRNA sequence. By 

• encoding" is meant that the sequence corresponds to a particular amino acid sequence 
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either in a sense or anti-sense orientation. By "extrachromosomaT is meant that the 
sequence is outs.de of the plant genome of which it is naturally associated. By 
"recombinant" is meant that the sequence contains a genetically engineered 
modification through manipulation via mutagenesis, restriction enzymes, and the like. 
5 A cDNA sequence may or may not contain pre-processing sequences, such as 

transit peptide sequences or targeting sequences to facilitate delivery of the FtsZ 
protein to a given organelle or membrane location. The use of any such precursor 
FtsZ DNA sequence is preferred for uses in plant cell expression. A genomic FtsZ 
sequence can contain the transcription and translation initiation regions, introns, 

1 0 and/or transcript termination regions of the plant FtsZ, which sequences can be used 
in a variety of DNA constructs, with or without the FtsZ structural gene. Thus, 
nucleic acid sequences corresponding to the FtsZ sequences of this invention can also 
provide signal sequences useful to direct protein delivery into a particular organellar 
or membrane location, 5* upstream non-coding regulatory regions (promoters) having 

1 5 useful tissue and timing profiles. 3' downstream non-coding regulatory regions useful 
as transcriptional and translation^ regulatory regions, and may lend insight into other 

features of the gene. 

Once the desired plant or other FtsZ nucleic acid sequence is obtained, it can 
be manipulated in a variety of ways. Where the sequence involves non-coding 
2 0 flanking regions, the flanking regions can be subjected to resection, mutagenesis, etc. 
Thus, transitions, transversions, deletions, and insertions can be performed on the 
naturally occurring sequence. In addition, all or part of the sequence can be 
synthesized. In the structural gene, one or more codons can be modified to provide 
for a modified amino acid sequence, or one or more codon mutations can be 

2 5 introduced to provide for a convenient restriction site or other purpose involved with 

construction or expression. The structural gene can be further modified by employing 
synthetic adapters, linkers to introduce one or more convenient restriction sites, or the 
like. 

For the most part, the constructs will involve regulatory regions functional in 

3 0 plants which provide for altered size and number of plastids in a plant cell. The open 

reading frame, coding for the FtsZ protein. FtsZ-related protein or functional fragment 
thereof will be joined at its 5' end to a transcription initiation regulatory region such as 
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the wild-type sequence naturally found 5' upstream to theFtsZ or FisZ-related 
structural gene, or to a heterologous regulatory region from a gene naturally expressed 
in plant tissues. Examples of useful plant regulatory gene regions include those from 
T-DNA genes, such as nopaline oroctopine synthase, plant virus genes, such as 
5 CaMV 35S, or from native plant genes. 

The DNA sequence encoding a plant or other FtsZ protein of this invention 
can be employed in conjunction with all or pan of the gene sequences normally 
associated with the FtsZ. In its component parts, a DNA sequence encoding FtsZ is 
combined in a DNA construct having, in the 5' to 3' direction of transcription, a 

1 0 transcription initiation control region capable of promoting transcription and 

translation in a host cell, the DNA sequence encoding plant FtsZ and a transcription 
and translation termination region. 

Potential host cells include both prokaryotic and eukaryotic cells. A host cell 
can be unicellular or found in a multiccllar differentiated or undifferentiated organism 

1 5 depending upon the intended use. Cells of this invention can be distinguished by 
having a FtsZ sequence foreign to the wild-type cell present therein, for example, by 
having a recombinant nucleic acid construct encoding a FtsZ protein therein not native 
to the host species. 

Depending upon the host, the regulatory regions will vary, including regions 
2 0 from viral, plasmid or chromosomal genes, or the like. For expression in prokaryoric 
or eukaryotic microorganisms, particularly unicellular hosts, a wide variety of 
constitutive or regulatable promoters can be employed. Expression in a 
microorganism can provide a ready source of the plant enzyme. Among 
transcriptional initiation regions which have been described are regions from bacterial 
25 and yeast hosts, such as E. coli, B. subtilis, Sacchromyces cerevisiae, including genes 
such as beta-galactosidase, T7 polymerase, tryptophan E and the like. 

In a preferred embodiment, the constructs will involve regulatory regions 
functional in plants which provide for modified production of plant FtsZ, and. 
possibly, modification of the plant cell plastid. The open reading frame coding for the 
3 0 plant FtsZ or functional fragment thereof will be joined at its 5' end to a transcription 
initiation regulatory region. In embodiments wherein the expression of the FtsZ 
protein is desired in a plant host, the use of all or part of the complete plant FtsZ gene 
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is desired; namely all or part of the 5' upstream non-coding regions (promoter) 
together with the structural gene sequence and 3' downstream non-coding regionscan 
be employed. 

If a different promoter is desired, such as a promoter native to the plant host of 
5 interest or a modified promoter, i.e., having transcription initiation regions derived 
from one gene source and translation initiation regions derived from a different gene 
source, numerous transcription initiation regions are available which provide for a 
wide variety of constitutive or regulable, e.g., inducible, transcription of the 
structural gene functions. The transcription/translation initiation regions 

1 0 corresponding to such structural genes are found immediately 5' upstream to the 

respective start codons. Among transcriptional initiation regions used for plants are 
such regions associated with the T-DNA structural genes such as fornopaline and 
mannopine synthases, the 19S and 35S promoters from CaMV, and the 5' upstream 
regions from other plant genes such as napin, ACP, SSU, PG, zein, phaseolin E. and 

1 5 the like. Enhanced promoters, such as double 35S, are also available for expression of 
FtsZ sequences. For such applications when 5' upstream non-coding regions are 
obtained from other genes regulated during seed maturation, those preferentially 
expressed in plant embryo tissue, such as ACP and napin-derived transcription 
initiation control regions, are desired. Such "seed-specific promoters" can be obtained 

20 and used in accordance with the teachings of issued U.S. Patent Numbers 5,608.152 
and 5,530,194, which references are hereby incorporated by reference. Transcription 
initiation regions which are preferentially expressed in seed tissue, i.e.. which are 
undetectable in other plant parts, are considered desirable for TAG modifications in 
order to minimize any disruptive or adverse effects of the gene product. 

2 5 Regulatory transcript termination regions can be provided in DN A constructs 

of this invention as well. Transcript termination regions can be provided by the DNA 
sequence encoding the plant FtsZ or a convenient transcription termination region 
derived from a different gene source, for example, the transcript termination region 
which is naturally associated with the transcript initiation region. Where the transcript 

3 0 termination region is from a different gene source, it will contain at least about 0.25 

kb, preferably about 1-3 kb of sequence 3' to the structural gene from which the 
termination region is derived. 
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Plant expression or transcription constructs having a plant FtsZ as the DN A 
sequence of interest for increased or decreased expression thereof can be employed 
with a wide variety of plant life, particularly, plant life involved in the production of 
vegetable oils for edible and industrial uses. Most especially preferred are temperate 
5 oilseed crops. Plants of interest include, but are not limited to, rapeseed (Canola and 
High Erucic Acid varieties), sunflower, safflower, cotton, soybean, peanut, coconut 
and oil palms, and corn. Depending on the method for introducing the recombinant 
constructs into the host cell, other DNA sequences can be required. Importantly, this 
invention is applicable to dicotyledenous and monocotyledenous species alike and 
1 0 will be readily applicable to new and/or improved transformation and regulation 
techniques. 

The method of transformation is not critical to the instant invention; various 
methods of plant transformation are currently available. As newer methods are 
available to transform crops, they can be directly applied hereunder. For example, 

15 many plant species naturally susceptible to Agrobacterium infection can be 

successfully transformed via tripartite or binary vector methods of Agrobacterium- 
mediated transformation. In addition, techniques of microinjection, DNA particle 
bombardment, and electroporation have been developed which allow for the 
transformation of various rnonocot and dicot plant species. 

2 0 In developing the DNA construct, the various components of the construct or 

fragments thereof will normally be inserted into a convenient cloning vector which is 
capable of replication in a bacterial host, e.g., E. coli. Numerous vectors exist that 
have been described in the literature. After each cloning, the plasmid can be isolated 
and subjected to further manipulation, such as restriction, insertion of new fragments, 

2 5 ligation, deletion, insertion, resection, etc., so as to tailor the components of the 

desired sequence. Once the construct has been completed, it can then be transferred to 
an appropriate vector for further manipulation in accordance with the manner of 
transformation of the host cell. 

Normally, included with the DNA construct will be a structural gene having 

3 0 the necessary regulatory regions for expression in a host and providing for selection of 

transformant cells. The gene can provide for resistance to a cytotoxic agent, e.g. 
antibiotic, heavy metal, toxin, etc.. complementation providing prototrophy to an 
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auxotrophic host, viral immunity or the like. Depending upon the number of different 
host species in which the expression construct or components thereof are introduced, 
one or more markers can be employed, where different conditions for selection are 
used for the different hosts. A number of markers have been developed for use for 
5 selection of transformed plant cells, such as those which provide resistance to various 
antibiotics, herbicides, or the like. The particular marker employed is not essential to 
this invention, one or another marker being preferred depending on the particular host 
and the manner of construction. 

As mentioned above, the manner in which the DNA construct is introduced 

1 0 into the plant host is not critical to this invention. Any method which provides for 
efficient transformation can be employed. Various methods for plant cell 
transformation include the use of Ti- or Ri-plasmids, microinjection, electroporation, 
DNA particle bombardment, liposome fusion, or the like. In many instances, it will 
be desirable to have the construct bordered on one or both sides by T-DNA, 

15 particularly having the left and right borders, more particularly the right border. This 
is particularly useful when the construct uses A. tumcfaciens or A rhizogenes as a 
mode for transformation, although the T-DNA borders can find use with other modes 
of transformation. 

Once a transgenic plant is obtained which contains cells with altered numbers 
2 0 and/or sizes of chloroplasts, tissue containing such cells can then be used in plastid 
transformation experiments. For example, utilizing ussue containing cells with larger 
plasdds provides for a larger target in plastid transformation methods, thus allowing 
for an increased probability of introduction of the foreign DNA into the plant cell 
plastid. 

2 5 The DNA sequences, or polynucleotides, for use in plastid transformation of 

this invention will contain a plastid expression construct generally comprising a 
promoter functional in a plant cell plastid, and a DNA sequence of interest to be 
expressed in the transformed plastid cells. 

Constructs and methods for use in transforming the plastids of higher plants 

30 are described in Zoubenko etal. (Nuc Acid Res (1994) 22(19):38 19-3824), Svab et ai 
(Proc. Natl. Acad. Sc.'.(1990) 87:8526-8530 and Proc. Natl. Acad. Sc/.(1993) 90:913- 
917) and Staub et al. (EMBO J. (1993) 12:601-606). Constructs and methods for use 
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in transforming plastids of higher plants to express DNA sequences under the control 
of a nuclearly encoded, plastid targeted T7 polymerase are described in U.S. Patent 
Number 5,576,198. The complete DNA sequences of the plastid genome of tobacco 
are reported by Shinozaki et al. (EMBO J. (1986) 5:2043-2049). 
5 Stable transformation of tobacco plastid genomes by particle bombardment is 

reported (Svab et.al. (1990), supra) and Svab et al. (1993), supra). The methods 
described therein can be employed to obtain plants homoplasmic for plastid 
expression constructs using the methods described herein. Briefly, such methods 
involve DNA bombardment of a target host explant, preferably from a tissue which is 

1 0 rich in metabolically active plastid organelles, such as green plant tissues including 
leaves, and cotyledons. The bombarded tissue is then cultured for ~2 days on a cell 
division promoting media. The plant tissue is then transferred to a selective media 
containing an inhibitory amount of the particular selective agent, as well as the 
particular hormones and other substances necessary to obtain regeneration for that 

1 5 particular plant species. For example, in the above publications and the examples 
provided herein, the selective marker is the bacterial aadk gene and the selective 
agent is spectinomycin. The aadA gene product allows for continued growth and 
greening of cells whose chloroplasts comprise the marker gene product. Cells which 
do not contain the marker gene product are bleached. The bombarded explants will 

2 0 form green shoots in approximately 3-8 weeks. Leaves from these shoots are then 
subcultured on the same selective media to ensure production and selection of 
homoplasmic shoots. As an alternative to a second round of shoot formation, the 
initial selected shoots can be grown to mature plants and segregation relied upon to 
provide transformed plants homoplastic for the inserted gene construct. 

2 5 The transformed plants so selected can then be analyzed to determine whether 

the entire plastid content of the plant has been transformed (homoplastic 
transformants). Typically, following two rounds of shoot formation and 
spectinomycin selection, approximately 50% of the transgenic plantlets analyzed are 
homoplastic as determined by Southern blot analysis of plastid DNA. These plantlets 

3 0 are selected for further cultivation, both for analysis of the transgenic plastid 

phenotype (where the nuclear viral polymerase expression construct is also present in 
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the plastid transformant). or for use in methods to transform the viral polymerase 
construct into the nucleus of the transplastomic plants. 

The methods of the present invention provide for a more efficient approach to 
obtaining homoplasm.c plants. Wild-type plant cells typically contain 50 to 100 
5 plastids per cell. However, once a transplastomic plant is obtained, the DNA 
sequence contained in the plant cell nucleus can be crossed away from the 
transplastomic cells. The DNA sequence transformed into the nucleus encoding for 
the alteration can be crossed away from the plant containing the transformed plastids. 
Once the DNA sequence has been crossed out. the plastids in the host plant cell can 
10 divide and revert back to normal (i.e. wild-type) plasud size and numbers. By 
applying the selective agent for which the plastid expression constructs provides 
resistance, cells containing a pure population of the plastids containing the foreign 

DNA can be obtained. 

The vectors for use in plastid transformation preferably include means for 
15 providing a stable transfer of the plastid expression construct and selectable marker 
construct into the plastid genome. This is most conveniendy provided by regions of 
homology to the target plastid genome. The regions of homology flank the construct 
to be transferred and provide for transfer to the plastid genome by homologous 
recombination, via a double crossover into the genome. The complete DNA sequence 

2 0 of the plastid genome of tobacco has been reported (Shinozaki et aL EMBO J. ( 1986) 

5:2043-2049). Complete DNA sequences of the plastid genomes from liverwort 
(Ohyama et aL. Nature ( 1986) J22:572-574) and rice (Hiratsuka et aL. Mol. Gen. 
Genet. (1989) 277:185-194), have also been reported. 

Where the regions of homology are present in the inverted repeat regions of 
25 the plastid genome (known as IRA and IRB), two copies of the transgene are expected 
per transformed plastid. Where the regions of homology are present outside the 
inverted repeat regions of the plastid genome, one copy of the transgene is expected 
per transformed plasud. The regions of homology within the plastid genome are 
approximately lkb in size. Smaller regions of homology can also be used, and as little 

3 0 as 100 bp can provide for homologous recombination into the plastid genome. 

However, the frequency of recombination and thus the frequency of obtaining plants 
having transformed plastids decreases with decreasing size of the homology reg.ons. 
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Examples of constructs comprising such regions of homology for tobacco plastid 
transformation are described in Svab et.al. (1990 supra) and Svab and Maliga (1993 
supra). Regions useful for recombination into tobacco and Brassica plastid genomes 
are also described in the following examples. Similar homologous recombination and 
5 selection constructs can be prepared using plastid DNA from the target plant species. 

Other means of transfer to the plastid genome are also considered herein, such 
as by methods involving the use of transposable elements. For example, the 
constructs to be transferred into the plastid genome can be flanked by the inverted 
repeat regions from a transposable marker which functions in plant plastids. A DNA 

1 0 construct which provides for transient expression of the transposase required to 

transfer the target DNA into the plastids is also introduced into the chloroplasts. In 
this manner, a variety of phenotypes can be obtained in plants transformed with the 
same expression construct depending on positional effects which can result from 
insertion of the expression constructs into various locations on the plastid genome. 

1 5 Appropriate transposons for use in such plastic transformation methods include 
bacterial TnlO, bacteriophage Mu and various other known bacterial transposons. 

The DNA sequence of interest in the plastid promoter expression constructs 
can be an encoding sequence which is oriented for expression of a particular structural 
gene, such that the protein encoded by the structural gene sequence is produced in the 

2 0 transformed plastid. In addition, the DNA sequence of interest can include a number 
of individual structural gene encoding regions such that an operon for expression of a 
number of genes from a single plastid promoter region is produced. Thus, it is 
possible to introduce and express multiple genes from an engineered or synthetic 
operon or from a pre-existing prokaryouc gene cluster. Such a method would allow 

2 5 large scale and inexpensive production of valuable proteins and fine chemicals in a 

particular desired plant tissue or a particular stage of development, depending upon 
the promoter used to drive nuclear expression of the specific viral polymerase. Such 
an approach is not practical by standard nuclear transformation methods since each 
gene must be engineered into a monocistron including an encoded transit peptide for 

3 0 plastid uptake and appropriate promoter and terminator signals. As a result, gene 

expression levels would be expected to vaiy widely between cistrons. and generation 
of a number of transgenic plant lines would be required. Ultimately crosses would be 
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required to introduce all of these cistrons into one plant to get express.on to the target 

biochemical pathway. 

Alternatively, the DNA sequence of interest in the plastid construct can be a 
fragment of an endogenous plastid gene oriented such that an RNA complementary to 
5 the endogenous gene mRNA is produced in the transformed plastid. Such antisense 
constructs can be used to decrease the expression of the target plastid gene. 

In order to provide a means of selecting the desired plant cells following 
plastid transformation, the polynucleotides for plastid transformation will also contain 
a construct which provides for expression of a marker gene. Expression of the marker 

1 0 gene product allows for selection of plant cells comprising plastid organelles which 
are expressing the marker protein. In the examples provided herein, a bacterial aadh 
gene is expressed under the regulatory control of chloroplast 5' promoter and 3' 
transcription termination regions. The use of such an expression construct for plastid 
transformation of plant cells has been described by S vab and Maliga (1993, supra). 

1 5 Expression of the aadh gene confers resistance to spectinomycin and streptomycin, 
and thus allows for the identification of plant cells expressing this marker gene. 
Selection for the aadk marker gene is based on identification of plant cells which are 
not bleached by the presence of streptomycin, or more preferably spectinomycin. in 
the plant growth medium. Other genes which encode a product involved in 

2 0 chloroplast metabolism can also be used as selectable markers. For example, genes 

which provide resistance to plant herbicides such as glyphosate, bromoxynil or 
imidazoline can find particular use. Such genes have been reported by Stalker et al. 
V. Biol. Chem. (1985) 260:4724-4728; glyphosate resistant EPSP), Stalker etal. (J. 
Biol. Chem. (1985) 265:6310-6314; bromoxynil resistant nitrilase gene), and 
25 Sathasivan et al. (Nucl. Acid, Res. (1990) 75:2188; AHAS .midazolinone resistance 
gene). 

The present invention also provides methods for obtaining a plastid 
transformed plant on medium containing glyphosate. At the initial event of 
transformation only a few plastids out of the many present in a plant cell are 

3 0 transformed and therefore are able to express glyphosate resistant marker gene 

product. The rest of the untransformed plastids within the cell remains vulnerable to 
the effect of glyphosate. Therefore, although the cell contains transformed plastids, it 
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is unable to divide and sort out the transformed plastid resulting in lack of recovery of 
transformed callus tissue which would give rise to the transformed regenerants. Thus, 
any method that reduces plastid number to one or few within the cell has the potential 
to survive the effect of glyphosate and be useful as selectable marker for plastid 
transformation. 

The following examples are provided by way of illustration and not by way of 
limitation. 



10 EXAMPLES 
Example 1: Identification of Plant/wZ Sequences 

In order to obtain a plant tissue source with an altered number and/or size of 
1 5 plastids using antisense and/or sense expression of the bacterial FtsZ plant 
homologues, public as well as proprietary sequence databases are queried for 
homologous sequences in soybean, rice, Arabidsopsis, com and Brassica. Two types 
of plant FtsZ proteins have been previously identified in GenBank, type IFtsZ 
proteins exemplified by accession gi!1079731 (SEQ ID NO:32), appear to be imported 

2 0 into the plastid, while type II FtsZ proteins, exemplified by accession gil3608494 

(SEQ ID NO:33) and gil683524 (SEQ ID NO:34), appear to remain in the cytoplasm. 
Homologs of both the type I FtsZ sequence as well as homologues of type IIFtsZ 
genes are described below. The sequences used to search against the databases are: 
type I FtsZ homologue search was (SEQ ID NO:32), and for type IIFtsZ searches, 
25 (SEQ ID NO:33) is used. 

Searches performed in proprietary databases containing sequences obtained 
from Arabidopsis identified DNA sequences which are related to the FtsZl sequence. 
The sequence of SEQ ID NO: 1 is identified as AtFtsZl . The deduced amino acid 
sequence encoded by SEQ ID NO: 1 is provided in SEQ ID NO:2. In addition, one 

3 0 sequence (SEQ ID NO:3) was identified as related to the FtsZ2 sequence. The 

deduced amino acid sequence encoded by SEQ ID NO:3 is provided in SEQ ID NO:4. 
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Sequences were also identified in databases containing sequences obtained 
from Brassica. One sequence was identified as related to the Arabidopsis FtsZl 
sequence. Based on sequence alignments between the two sequences, approximately 
170 amino acids were predicted to be missing from the Brassica sequence at the N- 
5 terminus. To obtain a full length coding sequence for the Brassica FtsZl (BnFtsZl) 
gene, RACE PCR using DNA obtained from Brassica leaves was performed using the 
primers SC258 (SEQ ID NO:35) and SC259 (SEQ ID NO:36). One reaction product 
was found to contain the most 5' sequence (SEQ ID NO:70) and was used to produce 
a full length sequence referred to as BnFtsZl (SEQ ID NO:5). The deduced amino 

10 acid sequence encoded by BnFtsZl is provided in SEQ ID NO:6) 

A FtsZl homolog was also identified in tobacco with PCR using primers 
designed to the conserved amino acid domains of the Arabidopsis FtsZl sequence. 
The PCR primers used are identified as SC252 (SEQ ID NO:37), SC253 (SEQ ID 
NO:38), SC254 (SEQ ID NO:39) and SC255 (SEQ ID NO:40). The reaction products 

1 5 were cloned into TOPO TA (Invitrogen), and a single clone, referred to as xanthil-26- 
contig (SEQ ID NO:7), contained the most sequence. Additional primers were 
designed for use in RACE PCR to obtain full length coding sequence for the tobacco 
FtsZl homolog. For amplification of the 5' region, primers SC291 (SEQ ID NO:41) 
and SC292 (SEQ ID NO:42) were used, and for amplification of the 3' sequence, 

2 0 primers SC293 (SEQ ID NO:43) and SC294 (SEQ ID NO:44) were used. The PCR 
products were cloned in TOPO TA and sequenced. Clone xanftsZl-5'-15 (SEQ ID 
NO:71) was chosen to be the best for the 5' tobacco FtsZl sequence since it contained 
the greatest amount of 5' sequence and overlap with xanthil-26-contig. This sequence 
was combined with the xanthil-26-contig to produce xanFtsZl (SEQ ID NO:8). The 

2 5 deduced amino acid sequence is provided in SEQ ID NO:9. 

FtsZ homolog sequences were identified in databases containing DNA 
sequences obtained from corn by BLAST searches using the Arabidopsis FtsZl and 
FtsZ2 amino acid sequences. Ten sequence were identified as related to these FtsZ 
sequences, provided in SEQ ID NOs: 10-19. The clones, when aligned, revealed six 

3 0 contigs, and the best representative clone for each were chosen for further analysis. 

Sequence analysis of SEQ ID NO: 10 revealed a high homology to AtFtsZl, and was 
estimated to be missing 158 amino acids at the N-terminal end when compared to 
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Arabidopsis FtsZ 1 . Clone SEQ ID NO: 1 3 was found to overlap perfectly with SEQ 
ID NO: 10 for 153 nt at the 5'end and in addition had 167 nt additional nt at the 5' end 
that had amino acid homology with the Arabidopsis FtsZl. However, this clone was 
also not predicted to encode the full-length FtsZ, and was still missing 1 1 3 amino 
5 acids at the N-terminal end when compared to Arabidopsis FtsZl . Interestingly, for 
clone SEQ ID NO: 13, its homology with SEQ ID NO: 10, ends at position 167nt and 
diverges. This could either be indicative of the presence of intronic sequence or a new 
class of FtsZ protein. Primer SC321 (SEQ ID NO:45) was designed to pull out the 
missing maize FtsZl sequence by RACE PCR. 

10 Sequence analysis of SEQ ID NO: 18 revealed its high homology to FtsZ2, and 

was also predicted to not to be full-length and missing about 286 amino acids at the 
N-terminal end when compared to Arabidopsis FtsZ2. Primer SC322 (SEQ ID 
NO:46) was designed to pull out the missing maize FtsZ2 sequence by RACE PCR. 
Although SEQ ID NO;14 and SEQ ID NO: 15 were identified with the highest BLAST 

15 scores with FtsZ2. 

Soybean FtsZ homolog sequences were identified in databases by BLAST 
searches with Arabidopsis FtsZl and FtsZ2 amino acid sequences. Twelve sequences 
were obtained, and are provided in SEQ ID NOs:20-31. Sequence analysis of SEQ ID 
NO:20, SEQ ID NO:24 and SEQ ID NO:25 revealed high homology to FtsZl and 

2 0 none to be full-length when compared to Arabidopsis FtsZ 1 . SEQ ID NO:25 had the 
longest sequence at the N-terminal end and is predicted to be missing 64 amino acids 
at the N-terminal when compared to Arabidopsis FtsZl sequence. Sequences of SEQ 
ID NO:20, SEQ ID NO:24 and SEQ ID NO:25 were used to correct the overlapping 
region. RACE PCR primers can now be designed to amplify the ends for obtaining a 

2 5 full length DNA sequence. 

A sequence alignment between the Arabidopsis. Brassica, tobacco, soybean, 
and com FtsZl protein sequences is provided in figure 1. 
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Example 2: Preparation of Plant Expression Constructs 
2A. Nuclear Expression Constructs 
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Constructs are prepared for transformation into a plant cell nucleus for 
alteration of the plasticl size and/or number in the transformed plant cell. Constructs 
can be prepared to alter the plastids constitutively, or in a tissue specific manner, for 
example, in leaf tissue, or seed tissue. 
5 A plasmid containing the napin cassette derived from pCGN3223 (described 

in USPN 5,639,790, the entirety of which is incorporated herein by reference) was 
modified to make it more useful for cloning large DNA fragments containing multiple 
restriction sites, and to allow the cloning of multiple napin fusion genes into plant 
binary transformation vectors. An adapter comprised of the self annealed 

1 0 oligonucleotide of sequence 

CGCGATTTAAATGGCGCGCCCTGCAGGCGGCCGCCTGCAGGGCGCGCCAT 

TTAAAT (SEQ ID NO:47) was ligated into the cloning vector pBC SK+ (Stratagene) 
after digestion with the restriction endonuclease BssHTI to construct vector 
P CGN7765. Plamids pCGN3223 and pCGN7765 were digested with NotI and 

15 ligated together. The resultant vector, pCGN7770. contains the pCGN7765 backbone 
with the napin seed specific expression cassette from pCGN3223. 

The cloning cassette, pCGN7787, essentially the same regulatory elements as 
pCGN7770, with the exception of the napin regulatory regions of pCGN7770 have 
been replaced with the double CAMV 35S promoter and the tml polyadenylation and 

20 transcriptional termination region. 

A binary vector for plant transformation, pCGN5139, was constructed from 
pCGN1558 (McBride and Summerfelt, (1990) Plant Molecular Biology, 14:269-276). 
The polylinker of pCGNl 558 was replaced as a HindIII/Asp718 fragment with a 
polylinker containing unique restriction endonuclease sites, AscI, Pad, Xbal, Swal, 

2 5 BamHI,and NotI. The Asp7 1 8 and HindHI restriction endonuclease sites are retained 

inpCGN5139. 

A series of turbo binary vectors are constructed to allow for the rapid cloning 
of DNA sequences into binary vectors containing transcriptional initiation regions 
(promoters) and transcriptional termination regions. 

3 0 The plasmid pCGN86 1 8 was constructed by ligating oligonucleotides 5'- 

TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGG-3' (SEQ ID NO:48) and 5'- 
TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC-3' (SEQ lDNO:49) into 
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SalUXhoI-digested pCGN7770. A fragment containing the napin promoter, polylinker 
and napin 3' region was excised from pCGN8618 by digestion with Asp718I; the 
fragment was blunt-ended by filling in the 5' overhangs with Klenow fragment then 
ligated into P CGK5 139 that had been digested with Asp7 1 81 and Hindlll and blunt- 
5 ended by filling in the 5' overhangs with Klenow fragment. A plasmid containing the 
insert oriented so that the napin promoter was closest to the blunted Asp7 181 site of 
P CGN5139 and the napin 3' was closest to the blunted Hindlll site was subjected to 
sequence analysis to confirm both the insert orientation and the integrity of cloning 
junctions. The resulting plasmid was designated pCGN8622. 

10 The plasmid pCGN8619 was constructed by ligating oligonucleotides 5'- 

TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC -3' (SEQ ID NO:50) and 5'- 
TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGG-3' (SEQ ID NO:5 1) into 
Sall/XhoI-digested pCGN7770. A fragment containing the napin promoter, polylinker 
and napin 3' region was removed from pCGN8619 by digestion with Asp7181; the 

1 5 fragment was blunt-ended by filling in the 5' overhangs with Klenow fragment then 
ligated into P CGN5139 that had been digested with Asp718I and Hindlll and blunt- 
ended by filling in the 5' overhangs with Klenow fragment. A plasmid containing the 
insert oriented so that the napin promoter was closest to the blunted Asp718I site of 
pCGN5139 and the napin 3' was closest to the blunted Hindlll site was subjected to 

2 0 sequence analysis to confirm both the insert orientation and the integrity of cloning 

junctions. The resulting plasmid was designated pCGN8623. 

The plasmid pCGN8620 was constructed by ligating oligonucleotides 5'- 
TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGGAGCT -3' (SEQ ID NO:52) 
and 5'-CCTGCAGGAAGCTTGCGGCCGCGGATCC-3' (SEQ ID NO:53) into 
2 5 Sall/SacI-digested pCGN7787. A fragment containing the d35S promoter, polylinker 
and tml 3' region was removed from pCGN8620 by complete digestion with Asp718I 
and partial digestion with Notl. The fragment was blunt-ended by filling in the 5' 
overhangs with Klenow fragment then ligated into P CGN5 139 that had been digested 
with Asp718I and Hindlll and blunt-ended by filling in the 5' overhangs with Klenow 

3 0 fragment. A plasmid containing the insert oriented so that the d35S promoter was 

closest to the blunted Asp718I site of pCGN5139 and the tml 3' was closest to the 
blunted Hindlll site was subjected to sequence analysis to confirm both the insert 
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orientation and the integrity of cloning junctions. The resulting plasmid was 

designated pCGN8624. 

The plasmid P CGN8621 was constructed by ligating oligonucleotides 5'- 
TCGACCTGCAGGAAGCTTGCGGCCGCGGATCCAGCT -3' (SEQ ID NO:54) 
5 and 5'-GGATCCGCGGCCGCAAGCTTCCTGCAGG-3' (SEQ TD NO:55) into 

SaiySacI-digested pCGN7787. A fragment containing the d35S promoter, polylinker 
and tml 3' region was removed from pCGN8621 by complete digestion with Asp718I 
and partial digestion with Notl. The fragment was blunt-ended by filling in the 5' 
overhangs with Klenow fragment then ligated into pCGN5139 that had been digested 
10 with Asp718I and HindlH and blunt-ended by filling in the 5' overhangs with Klenow 
fragment. A plasmid containing the insert oriented so that the d35S promoter was 
closest to the blunted Asp71 81 site of P CGN5 139 and the tml 3' was closest to the 
blunted Hindlll site was subjected to sequence analysis to confirm both the insert 
orientation and the integrity of cloning junctions. The resulting plasmid was 
15 designated pCGN8625. 

The plasmid construct pCGN8640 is a modification of pCGN8624 described 
above. A 938bp Pstl fragment isolated from transposon Tn7 which encodes bacterial 
spectinomycin and streptomycin resistance (Fling et al. {WS), Nucleic Acids 
Research 13(19):7095-7106), a determinant for E. coli and Agrobacterium selection, 
20 was blunt ended with Pfu polymerase. The blunt ended fragment was ligated into 
pCGN8624 that had been digested with Spel and blunt ended with Pfu polymerase. 
The region containing the Pstl fragment was sequenced to confirm both the insert 
orientation and the integrity of cloning junctions. 

The specunomycin resistance marker was introduced into pCGN8622 and 
2 5 pCGN8623 as follows. A 7.7 Kbp Avrll-SnaBI fragment from pCGN8640 was 
ligated to a 10.9 Kbp Avrll-SnaBI fragment from pCGN8623 or pCGN8622, 
described above. The resulting plasmids were pCGN8641 and pCGN8643, 
respectively. 

The Arabidopsis FtsZl nucleotide sequence was used to construct the sense 
3 0 expression vector P CGN6495 for use in transformation of Arabidopsis, Brassica and 
tobacco. For this construct, the Arabidopsis ftsZl sequence was PCR amplified. To 
monitor protein expression of FtsZl in transformed lines, a c-myc tag (EQKLISEEDL 

30 



SUBSTITUTE SHEET (RULE 26) 



WO 00/32799 PCT/US 99/28 103 

(SEQ ID NO:56)), was translationally fused to FisZl at the C-terminal end. The PCR 
amplification was done by first round of amplification with primers SC247 (SEQ ID 
NO:57) and SC260 (SEQ ID NO:58) followed by amplification with SC247 (SEQ ID 
NO:59) and SC261 (SEQ ID NO:60) using the product of the first amplification as the 
5 template DNA, using standard amplification parameters. The final amplification 
product, FtsZl/c-myc fusion was cloned in the nuclear transformation vector 
pCGN8624 to create pCGN6495, which was used to nuclear transform Arabidopsis, 
canola and tobacco using standard protocols. 

The turbo vector pCGN8624 was used for the antisense constructs such that 

1 0 the antisense sequence is driven from d35S promoter. For Arabidopsis the coding 
sequence (from ATG to TAG) was amplified with primers SC248 (SEQ ID NO:61) 
and SC250 (SEQ ID NO:62) using AtFtsZl as template. For Brassica, primers 
SC276 (SEQ ID NO:63) and SC268 (SEQ ID NO:64) were used with PCR fragment 
SC3-1-1 (SEQ ID NO:70) as template DNA to generate zHindOUPstl fragment and 

15 cloned in pBSKS (Stratagene) to generate pCGN6528. Primer SC276 was designed to 
be located 140 bases downstream from ATG due to the presence of nonhomologous 
stretch of sequence compared to Arabidopsis FtsZl contained in the first 140 bases 
sequence fragment. The 3' half of the coding sequence was PCR amplified using 
primers SC269 (SEQ ID NO:65) and SC270 (SEQ ID NO:66) to produce aPstLNotl 

2 0 fragment, and subsequently cloned in pCGN6528 to generate pCGN6529. The 

HindllUPstl fragment containing BnFlsZl sequence (from 140b downstream of ATG 
to TAG) was cloned in turbo vector pCGN8624 to generate final transformation 
vectors pCGN6530 and pCGN661 1. The HindHUNotl fragment containing BnFtsZl 
sequence was also cloned into pCGN8643 vector for seed-specific antisense FtsZl 

2 5 expression. For tobacco, primers SC305 and SC306 were designed to PCR amplify 
FtsZl sequence to produce a SseUNotl fragment using 5' RACE PCR library DNA 
made from leaf RNA, and cloned into TOPO TA2.1 to produce pCGN6565. The 
SseUNotl fragment from pCGN6565 was cloned in the turbo vector pCGN8624 to 
generate final transformation vector pCGN6566. 

30 

2A. Plastid Expression Constructs 
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Constructs and methods for use in transforming the plastids of higher plants 
are described in Zoubenko et al. (NucAad Res (1994) 22(1 9):38 19-3824), Svab et al. 
(Proc. Natl. Acad. Sci.(l990) 87:8526-8530 and Proc. Natl. Acad. 5d.(1993) 90:913- 
917) and Staub et al. {EMBO J. (1993) 12:601-606). Constucts and methods for use 
5 in transforming plastids of higher plants to express DNA sequences under the control 
of a nuclearly encoded, plastid targeted T7 polymerase are described in U.S. Patent 
Number 5,576,198. The complete DNA sequences of the plastid genome of tobacco 
are reported by Shinozaki et al. (EMBO J. (1986) 5:2043-2049). 

A plastid expression construct. P MON49218, was constructed to express the 
1 0 synthetic CP4 sequence with the 1 4 amino acid GFP fusion from the promoter region 
of the 16SrDNA operon having the nuclear-encoded RNA polymerase region 
(PrrnPEP+NEP), and the terminator region from the plastid rpsl6 gene. The DNA 
sequence of the Prm/NEP/G10L::14aaGFP fusion SEQ ID NO:67. 



15 

Example 3: Plant Transformation And Analysis 

Constructs for the expression of sense or antisense sequences are transformed 
into tobacco cells using the methods described by Ursin et. al. (1991) Plant Cell 
20 3:583-591. 

Transgenic tobacco plants containing the nuclear FtsZ constructs were 
analyzed for alterations in plastid morphology, including size and number of plastids 
present in the plant cell. 

Fifty-eight initial transformants (Tl generation) obtained from transformation 

2 5 with FtsZ 1 expression construct pCGN6495 were screened for the large plastid 

phenotype and divided into three categories. Thirty-four (34) lines contained less than 
5 large plastids, 8 lines contained between 5-20 plastids and 16 lines more than 20 
(wild-type# and more than wild-type#) plastids. One line, Nt6495-61, contained a 
single large plastid. 

3 0 The screening method involved examining isolated mesophyll protoplasts at 

100X magnification under light microscope. The large plastid containing transgenic 
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plants appear to be phenotypically indistinguishable from wild-type under culture and 

greenhouse conditions. 

Estimation of plastid DNA copy number from several large plastid lines 
revealed no difference when compared to wild-type. Southern analysis was used to 
estimate transgene copy number in the large plastid lines and several lines with single 
integration events were identified. Western analysis of the large plastid lines with c- 
myc antibody confirmed expression of the introduced transgene (tagged by c-myc). T2 
seeds were collected from selected plants from each of the three categories. 



10 



Example 4: Plastid Transformation and Analysis 



Leaf material from three transgenic lines, Nt6495-30 (with <5plastids/cell), 
N16495-16 (with 5-20 plastids/cell) and Nt6495-69 (with 5-20 plastids/cell), were 

1 5 obtained for evaluation of plastid transformation efficiency and direct glyphosate 
selection. Plastid transformation vector pMON49218 which contains aadA gene for 
spectinomycin selection and GFP as a marker was used to bombard 15 leaf explants of 
each of the three transgenic lines. For each series of bombardment of the transgenic 
line 15 wild type control leaves were used. The order of bombardment for the 

2 0 transgenic line and the wild type leaves were randomized to eliminate any bias. 

Transformation frequency of one event Nt6495-30 was approximately double 
that of the wild type control producing 7 versus 3 transformants respectively. Nt6495- 
16 and Nt6495-69 had approximately the same transformation frequency (3 
transformants) as the control. Thus, our preliminary analysis reveals that plastid 

2 5 transformation efficiency can have been enhanced by reducing the plastid number 

from wild type to less than 5 plastids per cell. Interestingly, all of the plastid 
transforms regenerants from Nt6495 lines were very much slower in growth and 
size compared to those from wild type. It appears that the presence of the selectable 
antibiotic spectinomycin dihydrochloride at a concentration of 500mg/ml can have 

3 0 affected the regenerability of cells in the Nt6495 lines. Thus, it is possible that there 

could be more plastid transformed cells in the transgenic Nt6495 lines which were 
susceptible to the antibiotic and could not regenerate. To check if this was the case, 
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kill curves with lower concentrations of spectinomycin ^hydrochloride (50. 100, 200, 
300, 400 and 500mg/ml) can be used with each of the Nt6495-30. Nt6495-16 and 
N16495-69 lines to establish the concentration at which the regeneration of shoots are 
as good as in wild type. This concentration of spectinomycin dihydrochlonde will 
, then be used to repeat transformation frequency tests with the three Nt6495 lines. 

To analyze for direct glyphosate selection, kill curves with varying levels of 
glyphosate will be established with the Nt6495 lines to find the best selection level. 
Plastid transformation vector pMON49218 will be used to bombard the Nt6495 lines 
and tested for direct selection using the optimized glyphosate level. 



10 



In Arabidopsis. FtsZl nuclear expression construct P CGN6495 was used to 
transform Columbia ecotype. Tl seeds were collected and about 100 kanamycin 
resistant seedlings were analyzed for alteration in plastid size and number following 
the same protocols as outlined for the tobacco section of this report. The transgenic 
1 5 plants were divided into three groups based on plastid number-I) 20 independent 
lines containing few large plastid (1-5),(U) 23 lines lines containing 5-20 plastids and 
(III) 50 lines containing wild-type plastid number were obtained. Selected T2 plants 
from each category were analyzed for number of transgene integration loci and sent to 
the growth chamber forT3 seed collection to identify homozygous plants. Such 
2 0 plants can be used in plastid transformations as described by Sikdar, et al. (1998) 
Plant Cell Reports, 18:20-24. 

Transformed plants selected for aadh marker gene expression or glyphosate 
resistance are analyzed to determine whether the entire plastid content of the plant has 
been transformed (homoplasmic transformants). Typically, following two rounds of 
2 5 shoot formation and spectinomycin selection, approximately 50% of the transgenic 
plantlets which are analyzed are homoplasmic, as determined by Southern blot 
analysis of plastid DNA. Homoplasmic plantlets are selected for further cultivation. 

Southern blot analysis is used to confirm the integration of the chimeric 
expression cassettes in the plastid genome. Preparation, electrophoresis, and transfer 
of DNA to filters is as described (Svab et al., (1993 supra)). Total plant cellular DNA 
can be prepared as described by Dellaporta et al. (1983) Plant Mol. Biol. Rep. 7:19- 
21). 
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To visually observe the expression of marker genes such as GFP from the 
chloroplasts of transformed plants, various tissues are visualized utilizing a dissecting 
microscope. Protoplasts and chloroplasts are isolated as described in Sidorov, et al. 
(1994) Thtor.Appl. Genet. 88:525-529. 
5 The above results demonstrate that the sequences of the present invention 

provide an efficient means for the production of plastid transformed plants. 
Furthermore, such methods find use in plastid transformation methods involving the 
selection of transplastomic plants on herbicides, for example glyphosate. 

1 0 All publications and patent applications mentioned in this specification are 

indicative of the level of skill of those skilled in the art to which this invention 
pertains. All publications and patent applications are herein incorporated by reference 
to the same extent as if each individual publication or patent application was 
specifically and individually indicated to be incorporated by reference. 

1 5 Although the foregoing invention has been described in some detail by way of 

illustration and example for purposes of clarity of understanding, it will be obvious 
that certain changes and modifications may be practiced within the scope of the 
appended claim. 
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CLAIMS 

What is claimed is: 

5 1 . In a method for transforming a plant cell plastid comprising the steps of 

introducing into cells of a plant a construct comprising a promoter functional in a 
plant ceU plastid operably associated with a DNA sequence of interest and 
transforming said plant cell plastid with said construct, 

wherein the improvement comprises introducing said construct into a plant cell 
10 having an altered plant plastid morphology selected from the group consisting of 
altered plastid size and altered plastid number in said plant cell. 

2. The method according to Claim 1 , wherein said plastid size is increased 
from a wild-type plant plastid morphology. 

3. The method according to Claim 2, further wherein said plastid number is 
1 5 decreased from a wild-type plant plastid morphology. 

4. The method according to Claim 1, wherein said plastid size is decreased 
from a wild-type plant plastid morphology. 

5. The method according to Claim 4, further wherein said plastid number is 
increased from a wild-type plant plastid morphology. 

2 0 6. The method according to Claim 2, wherein said plant cell is obtained from 

a plant tissue source in which plastid division is inhibited. 

7. The method according to Claim 6, wherein said plastid division is 
inhibited by introduction into cells of the plant tissue source a second DNA construct 
comprising in the 5' to 3' direction of transcription a promoter functional in a plant 

2 5 cell, a DNA sequence coding for a gene involved in plastid cell division and a 

transcriptional termination sequence functional in a plant cell. 

8. The method according to Claim 7, wherein said DNA sequence is in an 

antisense orientation. 

9. The method according to Claim 8, wherein said construct contains a DNA 

3 0 sequence coding for an FtsZ protein. 

10. The method according to Claim 7, wherein said DNA sequence is in a 

sense orientation. 
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1 1. The method according to Claim 10. wherein said DNA sequence prov.des 

for sense suppression. 

12. The method according to Claim 6, wherein said plastid division is 
inhibited by growing a plant under culture conditions which inhibit the division of 

5 plant cell plastids. 

13. The method according to Claim 12, wherein said culture conditions 
comprise growing the plant tissue source under exposure to an inhibitor of bacterial 
cell division. 

14. The method according to Claim 13, wherein said inhibitor is 5,5'-Bis-(8- 
1 0 anilino- 1 -naphthalenesulfonate). 

15. The method according to Claim 6, wherein said plastid division is 

inhibited by genetic mutagenesis. 

16. An isolated DNA sequence encoding a plant FtsZ protein from 

Arabidopsis thaliana. 

15 17. The DNA sequence of Claim 16, wherein said FtsZ protein is encoded by 

a sequence which includes a sequence selected from the group consisting of SEQ ID 
Nos: 1 and 3. 

18. An isolated DNA sequence encoding a plant FtsZ protein from Brassica . 

19. The DNA sequence of Claim 18, wherein said FtsZ protein is encoded by 

20 a sequence of SEQ ID NO:5. 

20. An isolated DNA sequence encoding a plant FtsZ protein from soybean. 

21 . The DNA sequence of Claim 20, wherein said FtsZ protein is encoded by 
a sequence which includes a seqeunce selected from the group consisting of SEQ ID 
NOs:20-31. 

25 22. An isolated DNA sequence encoding a plant FtsZ protein from corn. 

23. The DNA encoding sequence of Claim 22, wherein said FtsZ protein is 
encoded by a sequence which includes a sequence selected from the group consisting 

of SEQ ID Nos: 10-19. 

24. A recombinant DNA construct comprising any of the DNA encoding 

30 sequences of Claims 16-23. 

25. A plant cell comprising the DNA construct of 24. 

26. A plant comprising a cell of Claim 25. 
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27. A method for improving the selectability of plant comprising. 

transforming a plant cell source having an altered plastid morphology 
with a construct comprising a promoter functional in a plant cell plastid operably 
associated with a nucleic acid sequence encoding a selectable marker. 
5 28. The method according to Claim 27, wherein said nucleic acid sequence 

encodes an herbicide tolerance gene. 

29. The method according to Claim 27, wherein said nucleic acid sequence 
encodes a glyphosate tolerance gene. 

30. A method for preparing a plant cell source with increased plastid 
1 0 transformation efficiency comprising, 

transforming a plant cell with a construct comprising a promoter functional in 
plant cell operably associated with a nucleic acid sequence encoding aFtsZ protein. 

31. a method for transforming a plant cell plastid comprising, 

introducing into a plant cell having altered plastid morphology a first 
1 5 nucleic acid construct comprising a promoter functional in a plant cell plastid 
operably associated with a nucleic acid sequence of interest. 
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SEQUENCE LISTING 



<110> Calgene LLC 

<12 0> Methods for Transforming Plastids 



<130> 15595/00/WO 

<150> 60/109, 892 
<151> 1998-11-25 

<160> 73 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1495 
<212> DNA 

<213> Arabidopsis sp 

60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1495 



<21C> 2 
<211> 433 
<212> PRT 

<213> Arabidopsis sp 
<400> 2 



<400> 1 

atggcgataa ttccgttagc acagcttaat 
tttcttacca aatcgatatc ttctcattcg 
agaatcagtc aattccgtgg cggcttctct 
tcgatgcgat tgaggtgttc cttctctccg 
gtcggtggtg gtggtaacaa tgccgttaac 
gatttctatg cgataaacac ggattcgcaa 
cttcaaattg gagaactttt aactcgtggg 
gaacaagctg cagaagaatc aaaagatgca 
gttttcataa ctgctggtat gggtggtgga 
cagatttcga aggatgctgg ttatttgact 
gaaggacgta aaagatcttt gcaggcactg 
gataccctta tcgtgattcc aaatgatcgt 
cttcaggacg cgtttcttct tgcagatgat 
gatattatta ctatacctgg actagtcaat 
aaagattctg gaactgcaat gctcggggta 
gaagcagctg aacaagcaac tttggctcca 
ggtgtcgtct acaacatcac tggtggaaaa 
tcacaggtcg tgacaagttt ggcagaccca 
gatgatcgct acaccggaga gattcatgta 
ttccagaaga cacttctgac tgatccaaga 
tcaggtcaac aagagaacaa aggaatgtct 
atctctacca aatcgtcttc tccccgtaga 
cggtttcaag catcaaaaat gtaacgatct 
tcaaaaaaaa aaaaaaaagg cggccgctct 
«~rr»<-rrt-r*t-a actcttctat aqtqtcacct 



gagctaacga tttcttcatc ttcttcttcg 
ttgcatagta gctgcatttg cgcaagttct 
aaacgaagaa gcgattcaac aaggtctaag 
atggaatctg cgagaattaa ggtgattggt 
cggatgattt caagcggttt acagagtgtt 
gctctgttac agtcttctgc tgagaaccca 
cttggcactg gtggaaaccc gcttcttgga 
attgctaatg ctcttaaagg atcagacctt 
acagggtctg gtgctgcacc tgtggtagct 
gttggtgttg ttacctatcc gtttagcttt 
gaagctattg aaaagctcca aaagaatgtt 
ccgctagata ttgctgatga acagacgcca 
gttttacgcc aaggagtaca aggaatctca 
gtagattttg cagatgtgaa ggcagtcatg 
ggtgtttctt ccagcaaaaa ccgggcagaa 
ttgatcggat catccataca atcagctact 
gacataactt tgcaggaagt gaaccgagta 
tcggccaaca tcatatttgg agctgttgtg 
acgataatcg ccacaggctt ctctcagtca 
gcagctaaac tccttgacaa aatgggatca 
ctgcctcacc agaagcagtc tccatcaact 
cttttcttct agttctcttt ttttcctttt 
tcaggcccaa atatcaatta catttgattt 
agaggatcca agcttacgta cgcgtgcatg 
aaattcaatt cactggccgt cgttt 
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Met Ala He lie Pro Leu Ala Gin Leu Asn Glu Leu Thr He Ser Ser 

1 5 10 • 

Ser Ser Ser Ser Phe Leu Thr Lys Ser lie Ser Ser His Ser Leu His 

Ser Ser Cys lie Cys Ala Ser Ser Arg lie Ser Gin Phe Arg Gly Gly 

35 40 45 

Phe Ser Lys Arg Arg Ser Asp Ser Thr Arg Ser Lys Ser Met Arg Leu 

Arg Cys Ser Phe Ser Pro Met Glu Ser Ala Arg He Lys Val He Gly 

Val Gly Gly Gly Gly Asn Asr. Ala Val Asn Arg Met He Ser Ser Gly 

85 90 

Leu Gin Ser Val Asp Phe Tyr Ala He Asn Thr Asp Ser Gin Ala Leu 

100 1° 5 

Leu Gin Phe Ser Ala Glu Asn Pro Leu Gin He Gly Glu Leu Leu Thr 

115 120 12b 

Arg Gly Leu Gly Thr Gly Gly Asn Pro Leu Leu Gly Glu Gin Ala Ala 

i-jq "* 135 1^0 

Glu Glu Ser Lys Asp Ala He Ala Asn Ala Leu Lys Gly Ser Asp Leu 

"l Phe He Thr Ala Gly Met Gly Gly Gly Thr Gly Ser Gly Ala Ala 

Pro Val Val Ala Gin He Ser Lys Asp Ala Gly Tyr Leu Thr Val Gly 

180 185 19 

Val Val Thr Tyr Pro Phe Ser Phe Glu Gly Arg Lys Arg Ser Leu Gin 

195 200 203 

Ala Leu Glu Ala He Glu Lys Leu Gin Lys Asn Val Asp Thr Leu He 

Val He Pro Asn Asp Arg Leu Leu Asp He Ala Asp Glu Gin Thr Pro 

III Gin Asp Ala Phe Leu Leu Ala Asp Asp Val Leu Arg Gin Gly Val 

Gin Gly He Ser Asp He He Thr He Pro Gly Leu Val Asn Val Asp 

Phe Ala Asp Val Lys Ala Val Met lys Asp Ser Gly Thr Ala Met Leu 

275 280 aod 

Gly Val Gly Val Ser Ser Ser Lys Asn Arg Ala Glu Glu Ala Ala Glu 

290 295 300 

Gin Ala Thr Leu Ala Pro Leu lie Gly Ser Ser He Gin Ser Ala Thr 

Gly Val Val Tyr Asn lie Thr Gly Gly Lys Asp He Thr Leu Gin Glu 

Val Asn Arg Val III Gin Val Val Thr Ser Leu Ala Asp Pro Ser Ala 

Asn lie He Phe Gly Ala Val Val Asp Asp Arg Tyr Thr Gly Glu He 

355 360 365 

His Val Thr He lie Ala Thr Gly Phe Ser Gin Ser Phe Gin Lys Thr 

370 375 3 

Leu Leu Thr Asp Pro Arg Ala Ala Lys Leu Leu Asp Lys Met Gly Ser 

III Gly Gin Gin Glu Asn Lys Gly Met Ser Leu Pro His Gin Lys Gin 

Ser Pro Ser Thr He Ser Thr Lys Ser Ser Ser Pro Arg Arg Leu Phe 
420 425 430 

Phe 
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<210> 3 
<211> 1611 
<212> DNA 

<213> Arabidopsis sp 



<400> 3 

tgttgttgcc 

aagccaagct 

agggactagt 

ttttgaagag 

gggaggtggt 

gttctggatt 

taggttacaa 

cggtatgaat 

tatggtcttt 

tgcaggaatt 

gtttgagggt 

tgttgacact 

tccggtaaca 

atctgatatc 

aatggcaaat 

aagagatgct 

cactggaatt 

tgctgcggaa 

tgtagatcca 

acaagaagag 

agctacaaga 

cttgaagaag 

ctgcacactg 

tacagcaaga 

ctgcctctgg 

cttccaaaac 

gattatgaga 



gctcagaaat 
caagatcctt 
acaatagtca 
ccatctgctc 
ggatcaaatg 
gtcaacactg 
attggtaagg 
gctgctagag 
gtcacagctg 
gccaaggcga 
cgaagaagaa 
ctcatcgtca 
gaagcattta 
attacgattc 
gcggggtctt 
gcgctaaatg 
gtttggaaca 
gtaatatatg 
gccctcagcg 
ggagaaggac 
agaccctctt 
aaaggcagct 
cagcaataac 
tgtgttgatg 
tcgtgttctt 
aaaaaaggag 
gttttagcag 



ctgaatcttc 
tcttgaacct 
atccaagaaa 
cgagtaacta 
ctgtgaatcg 
atatccaggc 
agttgactag 
agagcaaaga 
gaatgggcgg 
tgggtatatt 
ctgttcaggc 
ttccaaatga 
atctagctga 
ctggtttggt 
cattgatggg 
caatccaatc 
ttactggcgg 
atcttgtcga 
gtcaagtaag 
gaacagttca 
cttcctttag 
ctcgttatcc 
aaacgtgtgt 
tatgatcaag 
gaataggttg 
aagaagaatc 
aaattattgt 



tccaatcaga 
tcacccggaa 
ggaaacgtct 
caatgaggcg 
tatgatagag 
tatgagaatg 
gggtttaggt 
agttattgaa 
tggaactggc 
gacagttggt 
tcaagaaggg 
caagttgctt 
tgatatactc 
gaatgtggat 
aataggaact 
ccctttgtta 
aagtgacttg 
tccaactgcc 
cataaccctg 
gatggtacaa 
agaaagcggt 
ccgagtctaa 
gtactggtag 
aatctgtgtg 
ttttagaaat 
acacttctcg 
caaaaaaaaa 



aactctccac 
atatctatgc 
tctggaccrg 
aggattaagg 
agtgaaatgt 
tctcctgttt 
gctggaggaa 
gaagctcttt 
actggtgcag 
attgccacaa 
cttgcatctc 
acagctgtct 
cgtcaggggg 
tttgctgatg 
gcgacaggaa 
gatattggga 
acattgtttg 
aatcttatat 
atagctacgg 
gcagatgctg 
tcagtggaga 
agcccaatct 
tctggtaccg 
ggtgtgtaca 
cggagtctct 
aaccataaac 
aaaaaaaaaa 



ggcattacca 
ttagaggtga 
ttgtcgagga 
ttattggtgt 
caggtgtgga 
tgcctgataa 
atccagaaat 
atggctcaga 
cccctgtaat 
cgcctttctc 
tcagagacaa 
ctcagtctac 
ttcgtgggat 
tgagagctat 
agagtcgggc 
ttgagagagc 
aggtaaatgc 
tcggtgctgt 
gtttcaaacg 
cgtcagttgg 
tcccagagtt 
aatcactacc 
ccttctggga 
tgrtctgtca 
ctctatgtca 
atacttataa 
a 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1611 



<210> 4 

<211> 397 

<212> PRT 

<213> Arabidopsis Sp 



<400> 4 














Met 


Leu 


Arg 


Gly 


Glu 


Gly 


Thr 


Ser 


1 








5 








Thr 


Ser 


Ser 


Gly 

20 


Pro 


Val 


Val 


Glu 


Ser 


Asn 


Tyr 
35 


Asn 


Glu 


Ala 


Arg 


lie 
40 


Gly 


Ser 


Asn 


Ala 


Val 


Asn 


Arg 


Met 


50 










55 




Glu 


Phe 


Trp 


Xle 


Val 


Asn 


Thr 


Asp 


65 








70 






Val 


Leu 


Pro 


Asp 


Asn 
85 


Arg 


Leu 


Gin 


Leu 


Gly 


Ala 


Gly 


Gly 


Asn 


Pro 


Glu 



Thr 


He 


Val 


Asn 


Pro 


Arg Lys 


Glu 




10 








15 




Asp 


Phe 


Glu 


Glu 


Pro 


Ser Ala 


Pro 


25 










30 


Gly 


Lys 


Val 


He 


Gly 


Val 


Gly Gly 








45 






lie 


Glu 


Ser 


Glu 


Met 


Ser Gly Val 








60 








He 


Gin 


Ala 


Met 


Arg 


Met Ser 


Pro 




75 








80 


He 


Gly 


Lys 


Glu 


Leu 


Thr Arg 


Gly 




90 








95 


Glu 


He Gly 


Met 


Asn 


Ala 


Ala Arg 
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105 










110 






Leu 


Tyr 


Gly 


Ser 


Asp 


Met 


Val 


Phe 








125 








Thr 


Gly 


Thr 


Gly Ala 


Ala 


Pro 


Val 






140 










Gly 


He 


Leu 


Thr 


Val 


Gly 


He 


Ala 




155 










160 


Arg 


Arg 


Arg 


Thr 


Val 


Gin 


Ala 


Gin 


170 








175 




Asn 


Val 


Asp 


Thr 


Leu 


He 


Val 


He 


185 








190 






vai 


Ser 


Gin 


Ser 


Thr 


Pro 


Val 


Thr 










205 








He 


Leu Arg Gin Gly 


Val 


Arg 


Gly 








220 










Gly 


Leu 


Val 


Asn 


Val 


Asp 


Phe 


Ala 




235 










240 


Ala 


Gly 


Ser 


Ser 


Leu 


Met Gly 


He 




250 










255 




Ala 


Arg 


Asp 


Ala 


Ala 


Leu 


Asn 


Ala 


265 








270 




lie 


Gly 


He 


Glu 


Arg 


Ala 
285 


Thr Gly 


Asp 


Leu 


Thr 


Leu 


Phe 


Glu 


Val 


Asn 






300 










Leu 


Val 


Asd 


Pro 


Thr 


Ala 


Asn 


Leu 




315 










320 


Ala 


Leu 


Ser Gly Gin 


Val 


Ser 


He 


330 










335 




Arg 


Gin 


Glu 


Glu 


Gly 


Glu Gly 


Arg 


345 










350 






Ala 


Ala 


Ser 


Val 


Gly 


Ala 


Thr 


Arg 










365 








Ser 


Gly 


Ser 


Val 


Glu 


He 


Pro 


Glu 








380 










Arg 


Tyr 


Pro 


Arg 


Val 









100 

Ser Lys Glu Val He Glu Glu Ala 
115 120 
Val Thr Ala Gly Met Gly Gly Gly 

130 135 
He Ala Gly He Ala Lys Ala Met 
145 150 
Thr Thr Pro Phe Ser Phe Glu Gly 
165 

Glu Gly Leu Ala Ser Leu Arg Asp 
180 

Pro Asn Asp Lys Leu Leu Thr Ala 
195 200 
Glu Ala Phe Asn Leu Ala Asp Asp 

210 215 
He Ser Asp He He Thr He Pro 
225 230 
Asp Val Arg Ala He Met Ala Asn 
245 

Gly Thr Ala Thr Gly Lys Ser Arg 
260 

He Gin Ser Pro Leu Leu Asp He 
275 ' 280 
Val Trp Asn He Thr Gly Gly Ser 

290 295 
Ala Ala Ala Glu Val He Tyr Asp 
305 310 
He Phe Gly Ala Val Val Asp Pro 
325 

Thr Leu lie Ala Thr Gly Phe Lys 
340 

Thr Val Gin Met Val Gin Ala Asp 
355 360 
Arg Pro Ser Ser Ser Phe Arg Glu 

370 375 
Phe Leu Lys Lys Lys Gly Ser Ser 
385 390 395 

<210> 5 
<211> 1450 
<212> DNA 
<213> Brassica sp 

<400> 5 

atggcgatta gtccgttggc acagcttaac gagctaccag tctcttcctc gtttcttgcg 
acatcccact cgctgcacag taccagaatc agtggcggct tctcaaaaca aaggtttaag 
caaacacggt tgagatgctc cttctctccg atggagtctg cgaggattaa ggtggttggt 
gtcggcggtg gtggtaacaa tgccgtcaat cgcatgattt ccagcggctt acagagtgtt 
gatttctatg cgataaacac ggactctcaa gctctcttgc agtcttctgc gcagaaccct 
cttcaaattg gagagctcct aactcgtggc cttgggactg gtgggaaccc gcttctagga 
gaacaagctg ctgaggaatc taaagacgcg attgctaatg ctcttaaagg atctgacctt 
gytttcatta ctgctggtat gggtggtggc actggctccg gtgctgctcc tgttgttgct 
cagatctcga aagacgctgg ttatttgacc gttggtgttg ttacctatcc cttcagcttc 540 
gaaggtcgta aaagatcttt gcaggcactt gaagccattg aaaagctgca gaagaacgtg 600 
gataccctca tcgtgatacc aaatgatcgt ctcctagata ttgctgatga acagacgcct 660 



60 
120 
180 
240 
300 
360 
420 
480 
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cttcaagacg 
gatattatta 
aaagattccg 
gaagcagctg 
ggtgtcgtct 
tctcaggtgg 
gatgatcgat 
ttccagaaga 
tcaggtcaac 
accaaatcat 
cctttatcaa 
ctcaaaagaa 
ccaagcttac 
attcactggc 



cttttcttct 
ctatacctgg 
gaactgcgat 
agcaagccac 
acaacatcac 
tgacaagttt 
acactggaga 
cacttctcag 
aacaagagaa 
cttctccccg 
aaatgtaacg 
gtaatttgtt 
gtacgcgtgc 



cgccgatgat 
actggtcaat 
gctcggggtg 
tttggctcca 
cggtggaaaa 
ggcagaccca 
gattcatgta 
tgatccaaga 
caaaggcagt 
tagattgttc 
atcttcaggc 
aaaccaaaaa 
atgcgacgtc 



gttttgcggc 
gtagattttg 
ggtgtttctt 
ttgatcggat 
gacattactt 
tcggccaaca 
acgataatag 
gcagctaaac 
caccagaggc 
ttctagtatc 
tcaaatatca 
aaaaaaaaaa 
atagctcttc 



aaggagttoa 
cggatgtgaa 
caagcaagaa 
catccattca 
tgcaggaagt 
tcatatttgg 
ccacggggtt 
tactcgacaa 
agtctcctgc 
ttttgttttt 
attacttttc 
gggcggccgc 
tatagtgtca 



aggaatctct 
gtcggttatg 
ccgagcagaa 
atcagctact 
gaaccgagta 
agctgttgtg 
ctcacagtct 
aacgggatca 
aactatcaac 
taagcatatt 
tccagattat 
tctagaggat 
cctaaattca 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1450 



<210> 6 

<211> 411 

<212> PRT 

<213> Brassica sp 



<220> 

<221> VARIANT 

<222> (1) . . . (411) 

<223> Xaa = Any Amino Acid 



<400> 6 












Met 


Ala 


He Ser 


Pro 


Leu 


Ala 


Gin 


1 






5 








Ser 


Phe 


Leu Ala 


Thr 


Ser 


His 


Ser 






20 










Gly 


Phe 


Ser Lys 


Gin 


Arg 


Phe 


Lys 




35 








40 


Ser 


Pro 


Met Glu 


Ser 


Ala 


Arg 


He 




50 








55 




Gly 


Asn 


Asn Ala 


Val 


Asn 


Arg 


Met 


65 








70 






Asp 


Phe 


Tyr Ala 


He 


Asn 


Thr 


Asp 






85 








Ala 


Gin 


Asn Pro 


Leu 


Gin 


He 


Gly 






100 










Thr 


Gly 


Gly Asn 


Pro 


Leu 


Leu 


Gly 




115 








120 


Asp Ala 


He Ala 


Asn 


Ala 


Leu 


Lys 




130 








135 




Ala 


Gly 


Met Gly Gly 


Gly 


Thr 


Gly 


145 








150 






Gin 


He 


Ser Lys 


Asp 


Ala 


Gly 


Tyr 








165 








Pro 


Phe 


Ser Phe 


Glu 


Gly Arg 


Lys 






180 










He 


Glu 


Lys Leu 


Gin 


Lys 


Asn 


Val 






195 








200 


Asp 


Arg 


Leu Leu Asp 


He 


Ala 


Asp 




210 








215 




Phe 


Leu 


Leu Ala 


Asp 


Asp 


Val 


Leu 



Leu 


Asn 


Glu 


Leu 


Pro 


Val 


Ser 


Ser 


10 










15 




Leu 


His 


Ser 


Thr 


Arg 


He 


Ser 


Gly 


25 










30 




Phe 


Gin 


Thr 


Arg 


Leu 


Arg 


Cys 


Ser 








45 








Lys 


Val 


Val 


Gly 


Val 


Gly 


Gly 


Gly 


He 


Ser 


O V 

Ser Gly 


Leu 


Gin 


Ser 


Val 




75 










80 


Ser 


Gin 


Ala 


Leu 


Leu 


Gin 


Ser 


Ser 


90 










95 


Gly 


Glu 


Leu 


Leu 


Thr 


Arg 


Gly 


Leu 


105 










110 






Glu 


Gin 


Ala 


Ala 


Glu 


Glu 


Ser 


Lys 








125 








Gly 


Ser 


Asp 


Leu 


Xaa 


Phe 


He 


Thr 






140 










Ser 


Gly 


Ala 


Ala 


Pro 


Val 


Val 


Ala 




155 










160 


Leu 


Thr 


Val 


Gly 


Val 


Val 


Thr 


Tyr 


170 










175 


Ala 


Arg 


Ser 


Leu 


Gin 


Ala 


Leu 


Glu 


185 










190 






Asp 


Thr 


Leu 


He 


Val 


He 


Pro 


Asn 








205 








Glu 


Gin 


Thr 


Pro 


Leu 


Gin 


Asp 


Ala 






220 










Arg 


Gin 


Gly Val 


Gin Gly 


He 


Ser 
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225 










Tin 










Asp 


He 


He 


Thr 


lie 


Pro 




Leu 


Val 


Asn 








245 










250 


Lys 


Ser 


Val 


Met 


Lys 


Asp 


Ser 




Thr 


Ala 






260 










2 65 




Ser 


Ser 


Ser 


Lys 


Asn 


Arg 


Aid 


Glu 


Glu 


Ala 






275 










2 80 






Ala 


Pro 


Leu 


He 


Gly 


Cor 


Ser 


He 


Gin 


Ser 




290 










2 95 








Asn 


He 


Thr Gly Gly 


Lys 


Asp 


He 


Thr 


Leu 


305 










ti n 
Jlw 










Ser 


Gin 


Val 


Val 


Thr 


Ser 


Leu 


Ala 


Asp 


Pro 










325 










330 


Gly Ala 


Val 


Val 


Asp 


Asp 


Arg 


Tyr 


Thr 


Gly 








340 










345 




He 


Ala 


Thr Gly Phe 


Ser 


Gin 


Ser 


Phe 


Gin 






355 










360 






Pro 


Arg 


Ala 


Ala 


Lys 


Leu 


Leu 


Asp 


Lys 


Thr 




370 










375 








Gin 


Glu 


Asn 


Lys 


Gly 


Ser 


His 


Gin 


Arg 


Gin 


385 










390 








Phe 


Thr 


Lys 


Ser 


Ser 


Ser 


Pro 


Arg 


Arg 


Leu 








405 










410 



235 

Val Asp 

Met Leu 

Ala Glu 

Ala Thr 
300 
Gin Glu 
315 

Ser Ala 

Glu He 

Lys Thr 

Gly Ser 
380 
Ser Pro 
395 
Phe 



Phe Ala 

Gly Val 
270 
Gin Ala 
285 

Gly Val 

Val Asn 

Asn He 

His Val 
350 
Leu Leu 
365 

Ser Gly 
Ala Thr 



240 
Asp Val 
255 

Gly Val 

Thr Leu 

Val Tyr 

Arg Val 
320 
He Phe 
335 

Thr He 

Ser Asp 

Gin Gin 

lie Asn 
400 



<210> 7 
<211> 1295 
<212> DNA 

<213> Nicotiana sp 



<400> 7 

tgccgttaac 

ggatgctcaa 

gactcgtggg 

gaaggaagcc 

gggtggaggt 

ctatttgact 

gcaggctctg 

aatgaccgtc 

gctgatgatg 

cttgtaaatg 

cttggagttg 

cttgcccctc 

ggaggaaaag 

gctgatccct 

aatacacgtg 

tgacccacga 

acctgttacc 

gctgttcttt 

ttttcttttt 

tagacatagt 

aaaaaaagta 

cactggcggc 

<210> B 
<211> 1255 



cggatgattt 
gcactgctac 
cttggtactg 
at tgcaaatt 
acaggatctg 
gttggtgttg 
gaagcaattg 
tgctagatat 
tattacgcca 
tggattttgc 
gggtttcatc 
ttaattggat 
accataactt 
cccgctaaca 
accataattg 
ggtgcaaagc 
ctgaggtcat 
tagctccttt 
tactttcttt 
gatgattctt 
ctctgcgttg 
cgctcgagca 



caagcggttt 
agtctgctgc 
gtggtaa tec 
ctctaaaagg 
gtgctgctcc 
tcacataccc 
aaaaacttca 
tgctgatgag 
aggtgtccaa 
cgatgtaaag 
aagcaagaac 
cgtccattca 
tgcagaaagt 
tcatatttgg 
caactggttt 
ttgttgataa 
caacctcacc 
atatagtttg 
gtatttacat 
ateaagtgea 
ttacccactg 
tgcatctaga 



acagggtgtt 
tgaaaacccg 
tcttttaggg 
ttcagatatg 
tgttgtggct 
attcagcttt 
gaaaaatgta 
cagacaccac 
ggaatttccg 
gtagtgatga 
cgtgctgaag 
atcgccactg 
gaatagggtg 
tgctgttgtg 
tacccagtct 
aggeccagta 
ttcgacaaca 
ttaeggctte 
gttttgctga 
tcacattcat 
ttaagggega 
gggee 



gacttctatg 
cttcaaattg 
gaacaggcag 
gtgttcataa 
caaatagcaa 
gaaggacgta 
gataccctta 
ttcaagatgc 
atataattac 
aagattctgg 
aagcagcega 
gggtagtatc 
tctcaggttg 
gatgageget 
tttcagaaga 
atccaagaaa 
tcacgaacac 
atttttctct 
ttggtgtttg 
actcgaaaaa 
attctgeaga 



ctataaacac 
gagaacttct 
tggaggagtc 
cagcaggaat 
aagaagcagg 
aaagatccgt 
tagtaattcc 
ttttcttctt 
tatacctggg 
aactgetatg 
acaagcaact 
caccattcca 
ttacagtctg 
acaatggega 
ctcttctctc 
gcatggcatc 
etacteggag 
tttcttactt 
catttggctg 
aaaaaaaaaa 
tatcccatca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1295 
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<212> DNA 

<213> Nicotiana sp 



<400> 8 

atggccacca 

cactcttcct 

aaacctcaac 

gttatcggcg 

cagggtgttg 

gaaaacccgc 

cttttagggg 

tcagatatgg 

gttgtggctc 

ttcagctttg 

aaaaatgtag 

cagacaccac 

ggaatttccg 

gtagtgatga 

cgtgctgaag 

tcagccactg 

aatagggtgt 

gctgttgtgg 

acccagtctt 

ggcccagtaa 

tcgacaacat 



tctcaaaccc 
ttattcctaa 
gtttcagcat 
tcggtggcgg 
acttctatgc 
ttcaaattgg 
aacaggcagc 
tgttcataac 
aaatagcaaa 
aaggacgtaa 
atacccttat 
ttcaagatgc 
atataattac 
aagattctgg 
aagcagccga 
gggtagtatc 
ctcaggttgt 
atgagcgcta 
ttcagaagac 
tccaagaaag 
cacgaacacc 



agcagagata 
acaatgctgc 
ttcaagttca 
tggtaacaat 
tataaacacg 
agaacttctg 
ggaggagtcg 
agcaggaatg 
agaagcaggc 
aagatccgtg 
agtaattccc 
ttttcttctt 
tatacctggg 
aactgctatg 
acaagcaac t 
caccattcca 
tacagtctgg 
caatggcgaa 
tcttctctct 
catggcatca 
tactcggagg 



gcagcttctt 
ttcaccaaag 
tttactcctt 
gccgttaacc 
gatgctcaag 
actcgtgggc 
aaggaagcca 
ggtggaggta 
tatttgactg 
caggctctgg 
aatgaccgtc 
gctgatgatg 
cttgtaaatg 
cttggagttg 
cttgcccctc 
ggaggaaaag 
ctgatccctc 
atacacgtga 
gacccacgag 
cctgttaccc 
ctgttctttt 



ctccttcctt 
ctcgccggaa 
ttgattctgc 
ggatgatttc 
cactgctgca 
ttggtactgg 
ttgcaaattc 
caggatctgg 
ttggtgttgt 
aagcaattga 
tgctagatat 
tattacgcca 
tggattttgc 
gggtttcatc 
ttattggatc 
acataacttt 
ccgctaacat 
ccacaattgc 
gtgcaaagct 
tgaggtcatc 
agctcettta 



tgctttttac 
aagcttatgt 
taagattaag 
aagcggttta 
gtctgctgct 
tggtaatcct 
tctaaaaggt 
tgctgctcct 
cacataccca 
aaaacttcag 
tgctgatgag 
aggtgtccaa 
cgatgtaaag 
aagcaagaac 
gtccattcaa 
gcagaaagtg 
catatttggt 
aactggtttt 
tgttgataaa 
aacctcacct 
tatag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1255 



<210> 9 

<211> 413 

<212> PRT 

<213> Nicotiana sp 



<400> 9 






Met 


Ala 


Thr 


He 


1 








Phe 


Ala 


Phe 


Tyr 








20 


Lys 


Ala 


Arg 


Arg 






35 




Ser 


Ser 


Phe 


Thr 




50 






Gly Gly 


Gly 


Gly 


65 








Gin Gly 


Val 


Asp 


Gin 


Ser 


Ala 


Ala 








100 


Gly 


Leu 


Gly 


Thr 






115 




Glu 


Ser 


Lys 


Glu 




130 






Phe 


lie 


Thr 


Ala 


145 








Val 


Val 


Ala 


Gin 


Val 


Thr 


Tyr 


Pro 



Ser 


Asn 


Pro 


Ala 


5 








His 


Ser 


Ser 


Phe 


Lys 


Ser 


Leu 


Cys 






40 


Pro 


Phe 


Asp 


Ser 






55 




Asn 


Asn 


Ala 


Val 




70 






Phe 


Tyr 


Ala 


He 


85 








Glu 


Asn 


Pro 


Leu 


Gly 


Gly 


Asn 


Pro 








120 


Ala 


He 


Ala 


Asn 






135 




Gly 


Met 


Gly 


Gly 




150 






He 


Ala 


Lys 


Glu 


165 








Phe 


Ser 


Phe 


Glu 



Glu 


lie 


Ala 


Ala 




10 






He 


Pro 


Lys 


Gin 


25 








Lys 


Pro 


Gin Arg 


Ala 


Lys 


He 


Lys 






60 


Asn 


Arg 


Met 


lie 






75 




Asn 


Thr 


Asp 


Ala 




90 






Gin 


He 


Gly 


Glu 


105 








Leu 


Leu 


Gly Glu 


Ser 


Leu 


Lys 


Gly 








140 


Gly 


Thr 


Gly 


Ser 




155 




Ala 


Gly 


Tyr 


Leu 




170 






Gly 


Arg 


Lys 


Arg 



Ser 


Ser 


Pro 


Ser 






15 




Cys 


Cys 


Phe 


Thr 




30 






Phe 


Ser 


He 


Ser 


45 








Val 


He 


Gly 


Val 


Ser 


Ser 


Gly 


Leu 








80 


Gin 


Ala 


Leu 


Leu 






95 




Leu 


Leu 


Thr 


Arg 




110 






Gin 


Ala 


Ala 


Glu 


125 








Ser 


Asp 


Met 


Val 


Gly Ala 


Ala 


Pro 








160 


Thr 


Val 


Gly 


Val 






175 




Ser 


Val 


Gin 


Ala 
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130 



Leu 


C2~\ ii 
blu 


Ala 


He 


Glu 




Leu 


Gin 






195 










2C0 


iic 


Pro 


Asn 


Asp 


Arg 


Leu 


Leu 


Asp 




210 










215 




Gin 


Asp 


Ala 


Phe 


Leu 


Leu 


Ala 


Asp 


225 








230 






Gly 


He 


Ser 


Asp 


He 


lie 


Thr 


lie 








245 








Ala 


Asp 


Val 


Lys 


Val 


Val 


Met 


Lvs 








26C 










Val 


Gly 


Val 


Ser 


Ser 


Ser 


Lys 


Asn 




27 5 










280 


Ala 


Thr 


Leu 


Ala 


Pro 


Leu 


He 


Gly 




290 










295 




Val 


Val 


Ser 


Thr 


He 


Pro 


Gly 


Gly 


305 










310 






Asn Arg 


Val 


Ser 


Gin 


Val 


Val 


Thr 










325 








He 


He 


Phe 


Gly Ala 


Val 


Val 


Asp 








340 










Val 


Thr 


He 


He 


Ala 


Thr 


Gly 


Phe 






355 










360 


Leu 


Ser 


Asp 


Pro 


Arg 


Gly 


Ala 


Lys 




370 








375 




Gin 


Glu 


Ser 


Met 


Ala 


Ser 


Pro 


Val 


385 










390 






Ser 


Thr 


Thr 


Ser 


Arg 


Thr 


Pro 


Thr 



405 



185 










190 




Val 


Lvs 


Asn 


Val 


Asp 


Thr 


Leu 


lie 








205 








lie 


Ala 


Asp 


Glu 
220 


Gin 


Thr 


Pro 


Leu 


Asp 


Val 


Leu 


Arg 


Gin Gly Val 


Gin 




235 










240 


Pro 


Gly 


Leu 


Val 


Asn 


Val 


Asp 


Phe 




250 










255 


Gly 


Asp 


Ser 


Gly 


Thr 


Ala 


Met 


Leu 


265 










27C 






Arg 


Ala 


Glu 


Glu 


Ala 


Ala 


Glu 


Gin 








285 








Ser 


Ser 


He 


Gin 
300 


Ser 


Ala 


Thr 


Gly 


Lys 


Asp 


lie 


Thr 


Leu 


Gin 


Lys 


Val 




315 










320 


Val 


Trp 


Leu 


lie 


Pro 


Pro 


Ala 


Asn 




330 










335 


His 


Glu 


Arg 


Tyr 


Asn Gly Glu 


He 


345 






350 






Thr 


Gin 


Ser 


Phe 


Gin 
365 


Lys 


Thr 


Leu 


Leu 


Val 


Asp 


Lys 
380 


Gly 


Pro 


Val 


He 


Thr 


Leu 


Arg 
395 


Ser 


Ser 


Thr 


Ser 


Pro 
400 


Arg 


Arg 
410 


Leu 


Phe 


Phe 









<210> 10 
<211> 1278 
<212> DNA 
<213> Zea mays 



<220> 

<221> misc_feature 
<222> (1) - . . (1278) 
<223> n = A,T,C or G 



<400> 10 

gatcttgtct 

gttgcccaga 

agcttcgagg 

agtgtagaca 

atgcccttgc 

atatcagaca 

gtcatgaaaa 

gcccaagaag 

gctactggcg 

aaggtgtccc 

gtcgttgatg 

cagtccttcc 

gagaaagcag 

tctgcttggt 



tcataacagc 
tatcaaagga 
gccgtaagcg 
cacttantgt 
aagatgcatt 
tcatcacaat 
actctggaac 
ctgctgaaca 
ttgtgtataa 
agattgtgac 
accgttacac 
agaaatccct 
caaccctcgc 
ctcgaagact 



tgggacggga 
agctggttat 
ctctgtacag 
gattccaaat 
tctccttgca 
accgggactt 
tgccatgctc 
ggcaacactt 
tattactggt 
aagcctagct 
tggtgagata 
tttggcggat 
ccataaagca 
cttctcctga 



gggggtactg 
cttactgttg 
gcattggaag 
gataagttat 
gatgatgtcc 
gtcaatgttg 
ggtgttggtg 
gctcctttga 
gggaaggaca 
gacccatctg 
catgtgacaa 
ccaaagggag 
gcagcagctg 
gaagctcatt 



gatctggtgc 
gtgttgtcac 
cactagagaa 
tagatgttgc 
ttcgtcaggg 
attttgctga 
tttcttccag 
ttggatcatc 
tcactttgca 
cgaacataat 
tcattgcgac 
cacgtatagt 
cagttcaacc 
tggtgaaccg 



tgctccagtt 
ctatccattc 
gctggaaaag 
ggatgaaaac 
tgttcaagga 
tgtaaaagct 
caaaaatcgg 
catcgaggca 
agaagtgaac 
tttcggtgct 
aggatttcca 
ggaatccaaa 
ggtccctgct 
tgactcgtag 



60 
120 
180 
240 
300 
360 
42 0 
480 
540 
600 
660 
720 
780 
840 
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900 



rgcattagat ttgcatttag cgtgctgagg gcagtcccta aggtgatctt cggatatctg 
gagatttata gcttgggcta gtgttcggta gtggtagaat aagtttcagt gtatgtatcg 960 
itgctttgct ttatgttttt gaggatcagg cggtgaggct gagagaagtg ctcagcaact 1020 
caacatLgaa ctgttgraga agatctttga ttgcttttat tgctgcaaca tgccaacwc 
cctctgttgg attcamcmna aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaanncaaa aaaaaaaaaa aaaaaaaagg gcggccgccg 
actagtgagc tcgtcgaccc gggaattaat tccggaccgg tacctgcagg cgtaccagct 



ttccctatag tgagtcgt 

<210> 11 

<211> 283 

<212> DNA 

<213> Zea mays 

<400> 11 



gctataaaca ccgattccca agcccttatt aattcacaag cgcaatatcc tctgcaaatt 
ggagagcagt tgacccgcgg cttaggtgcc ggtggaaatc cgaatttggg Wjcaggct 
gctgaggaat caagagaaac catagccact gccctgaggg attcagatct tgtcttcata 
acagctggga tgggaggggg tactggatct ggtgctgetc cagttgttgc ccagatacca 
aaggaagctg gttatcttac tgttggtgtt gtcacctatc cattcagttt cgagggccgt 
aagcgctctg tacaggcaaa gtatctgagc cccccttcac tcctgaattt taattcaaac 
tgtcatatct cgttctgcga ctttcttttg ctcgatggaa gcattagttt gtagtcataa 
caatgacacc cagccacatt tattgctgat gatgtataca atggtaggtc aaagaaatgt 
agcatcatgc catcacctgt agttcatctc atcattttgt tcctactttt ctgcgtggtt 
gatgcccaaa acaatataca actatgtggt tgtactgCtg cattgccttg tggagggatg 
tttatgttgt gaaatatttc aaaacacatg tcatcatgaa tattccctcc tgtggttgtg 



1020 
1080 
1140 
1200 
1260 
1278 



60 
120 



cctccagttg ttgcccagat. atcaaaggaa gctggttatc ttactgttgg tgttgtcacc 
tatccattca gtttcgaggg ccgtaagcgc tctgtacagg cattggaagc ^tagagaag 
ctggaaaaga gtgtagacac acttattgtg attccaaatg ataagttatt agatgttgcg 180 
gafgaaaaca tgcccttgca agatgcattt ctccttgcag atgatgtcct tcgtcagggt 240 
gttcaaggaa tatcagacat catcacaata ccgggacttg tea 

<210> 12 
<211> 287 
<212> DNA 
<213> Zea mays 

<220> 

<221> misc_f eature 
<222> (1) . . . (287) 
<223> n = A,T,C or G 

gggccgtaag cgctctgtac aggcattgga agcactagag aagctggaaa agagtgtaga 60 
cacacttatt gtgattccan atnatnngtt attagatgtt gcggatgaaa acatgccctt 120 
geaagatgea tttctccttg cagatgatgt ccttcgtcag ggtgttcaag S^tatcaga 180 
catcatcaca atacegggae ttgtcaatgt tgattttget gatgtaaaag ctgtcatgaa 240 
aaactctgga actgccatgc tcggtgttgg tgtttcttcc agcaaaa 

<210> 13 
<211> 1122 
<212> DNA 
<213> Zea mays 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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gggacttgtt 
cgtctttcgt 
gagagtataa 
acataaaata 
cttaatatat 
aaaaatgtga 
ataggcacag 
ttgacttaga 



tcaaatgcra 
ccatgaaaca 
ttaccaagct 
aagggtataa 
tgaatcattg 
ttgcaagtta 
gacacagcct 
cctttatgga 



tgaattaaga 
ttattccctt 
taaatcttcg 
aaaggggtaa 
aatacaataa 
ccagaatgcg 
gtggaggaat 
ctaaaagatc 



acaaggcaac 
gaggataacg 
taacaaaatt 
ataaatca~a 
tacctctgcc 
tgaacagtaa 
tcaattatac 
attgctatct 



acaaagtgtt 
ggcctcggac 
tcaatagata 
gacgaattat. 
ttggcaaagg 
aggaatactg 
ccgtcataag 
tt 



aaatgttaac 
aaaggctgat 
ttgtaagata 
atcatattta 
ttggattccg 
ttcactattt 
agtttacaca 



720 
780 
840 
900 
960 
1020 
1080 
1122 



<210> 14 
<211> 291 
<212> DNA 
<213> Zea mays 



<220> 

<221> misc_f eature 
<222> (1) - . . (291) 
<223> n = A,T,C or G 



<400> 14 

aaaatagtgt 

caaatacacc 

gtggcatatc 

gtgctatcat 

caagagcaag 



ggacacccta atcgtcatcc caaatgataa gttgctgtct gctgtttctc 
tgtaactgaa gcatttaatc tggctgatga tattcttcgt caaggcattc 
tgatataatt acggttcctg ggnaggttaa tgttgatttt gctgacgtac 
gcaaaatgca gggtcatcct tgatgggtat agggactgct acaggaaagt 
ggatgctgct cttaacgcca tccagtcgcc gctgcttgat a 



60 
120 
180 
240 
291 



<210> 15 
<211> 415 
<212> DNA 
<213> Zea mays 



<220> 

<221> misc_feature 
<222> (1) . . . (415) 
<223> n = A,T,C or G 



<400> 15 

gagcaaggga tgctgctctt aacgccatcc agtcgccgcC gcttgatatt ggaattgaaa 



60 



gagccacagg cattgtgtgg aatatcactg ggggaactga cctgactttg tttgaggtga 12U 
atgctgcggc cgaaattatc tacgaccttg tcgatccaaa cgctaatctg atatttggcg 
ccgtcacaga cccgtcactg agtgggcagg tgagcataac cttgatagct actggctnca 
aacggcagga tgaaccagaa ggccgcgtgt cgaagggtgg gcaacaaggt gagaatggcc 
gacgcccatc cccagcanag ggcaacaaca cggtggaaat tccaaaattc ccgccaacaa 
aaagggccct tccnncttcc cacnattttg actggtcctg tctgcacctg tatga 415 



180 
240 
300 
360 



<210> 16 

<211> 744 

<212> DNA 

<213> Zea mays 



<220> 

<221> misc_feature 
<222> (1) . . . (744) 
<223> n = A,T,C or G 
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<400> 16 

aattcccggg 

gatctaacct 

ggtgcaaatc 

actctaattg 

ggaggagaca 

catgcattgc 
gaacacactt 
tgggtctctt 
tgcaaacttc 
gtgaggtctc 
gttccgtgtg 
aaaaaggcgg 
ntatagngtn 



tcgacccacg 
tgacagaggt 
tgatttttgg 
caactggttt 
agcaaccgcg 
aaatcccana 
tggatcaatg 
aggtagcaag 
ngccgctgag 
aatttgctag 
aatgattact 
cgctntanag 
acctaaattc 



cgtcccgcgg 
gaatgctgca 
ctctgttata 
caaacgccag 
gtcgctcggc 
gttcctacag 
tttttcttgt 
gtagaacaga 
taccattggc 
ttagtatgat 
tgctcnctgc 
gatccaagct 
aatc 



acgcgtgggt 
gctgaagtaa 
gatccgtcat 
gaggaaagtg 
tggttttctc 
aggaaagggc 
catagtttgg 
tgttcctgaa 
ttgggctgct 
taaaagtnaa 
cattttcttt 
tacttccccc 



ggaatatcac 
tctatgatct 
acactggtca 
agagccggtc 
ccacttccca 
gtccagggtt 
tacgatgcag 
cccgcacata 
ttgcttctca 
gcgctgagac 
tcaaaaaaaa 
gcatncgacn 



tggagggaac 
tgttgaccct 
agtgagcata 
ttcacaggct 
ggaggaaggt 
tcacgagtct 
gtttggtttc 
ctaatctgtg 
ngaacctgca 
caaattatac 
aaaaaaaaaa 
canagctntt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
744 



<210> 17 
<211> 230 
<212> DNA 
<213> Zea mays 



<220> 

<221> misc — feature 
<222> (1) ... (230) 
<223> n = A,T,C or G 



<400> 17 

ggctgctgag gaatcaagag aaaccatagc cactgccctg agggattcag atcttgtttt 

cataacagct gggatgnnag ggggtgctgc tccaattgtt gcccagatat caaaggaagc 

tggttatctt actgttggtg ttgtcaccta tccattcaat ttcgagggcc gtaagcgctc 

tttacaggca agtatctgag ccccccttca ctcctgaatt agaattcaaa 



60 
120 
180 
230 



<210> 18 
<211> 318 
<212> DNA 
<213> Zea mays 



<220> 

<221> misc_feature 
<222> (1) . . . (318) 
<223> n = A,T,C or G 



<400> 18 

caggcattgt 

cggccgaaat 

tagacccgtc 

ggatgaacca 

gtcccccgca 

ctcgcttccc 



gtggaatatc 
tatctacgac 
actgagtggg 
gaaggccgcg 
gagggcagca 
agagttga 



actgggggaa 
cttgtcgatc 
caggtgagca 
tgtcgaaggg 
gcacggtgga 



ctgacctaac 
caaatgctaa 
taacctgata 
tgggcaacaa 
gttccagagt 



tttgtttgag 
tctgatattt 
gctactggct 
agtgagaatg 
cctgcgacgt 



gtgaatgctg 
ggtgccgtca 
tcaaacggca 
gccgacgccc 

agagganctt 



60 
120 
180 
240 
300 
318 



<210> 19 
<211> 471 
<212> DNA 
<213> Zea mays 



<220> 
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<221> misc_feature 
<222> (1) - - • (471) 
<223> n = A.T.C or G 



<400> 19 

cgacgcccaa 

anacattacc 

natatttact 

ctctcacaca 

gtgagctttt 

gtgttagtcg 

ctggtgatta 

aaaaaaaaaa 



ggtgacgaat 
tgcc tcactc 
cctgctgctg 
gcatgcgccg 
gttcgagtga 
tctcagggaa 
aataanaaag 
aaaaaaaaag 



gctgtcagcc 
ntgcttgctc 
cttgaggcca 
gccgacgacg 
tacgtgatcc 
ccgggcanca 
gggtatgttg 
ggcggccgcc 



acgctgtgct 
ctgtaaatat 
ttattctgta 
tacgtacgtg 
atccatgcat 
naagggggtg 
gatgtgtgca 
gactagtgag 



acacggggga 
aatgatngtc 
cgtaaatgaa 
tattatatac 
ggatgcttat 
ttgtattana 
aaaaaaaaaa 
ctcgtcgacc 



aacaatgcaa 
gctgctacat 
gccactacta 
gctctacccc 
gtatgtatat 
tttacgtctt 
aaaaanaaaa 
c 



60 
120 
180 
240 
300 
360 
420 
471 



<210> 20 
<211> 1085 
<212> DNA 
<213> Glycine sp 

<220> 

<221> misc_feature 
<222> (1) . . . (1085) 
<223> n = A,T.C or G 



<400> 20 

cggctcgagg 

gaatccactt 

taaaggatca 

tgccccagtt 

ctatcccttc 

gctgcagaaa 

tratragcar 

gagtmcaggg 

agatgtaaaa 

cggtaaaaaa 

cctctattca 

cctgcaggaa 

tattatattt 

tgcaactggc 

agctgcttga 

cctcaaacaa 

ctttttaccc 

aaaaaaaaaa 

gtacc 



aaccctatta 
ttgggggaac 
gatttggtgt 
gtagcccaaa 
agttttgaag 
aatgttgaca 
atgcctcttc 
aatatcagac 
gctgtgatga 
ccgagcagaa 
gtcaagctac 
gtgracaggg 
ggggctgtcg 
ttctcacagt 
caaggtggct 
ggttgaatct 
tttttcattt 
ggcggccgcc 



aaattggaga 
aagctgcaga 
ttataacggc 
tatcaaaaga 
gacgtaagag 
cmmttatagt 
aaggatgctt 
attatamctg 
aagactctgg 
gaagcagccg 
tggggtagtg 
tttytmaggt 
ttgatgatcg 
cttttcagaa 
gagggccaag 
agaccatccc 
ttccaattat 
gactagtgag 



agttctgact 
ggaatcaaga 
tgggatgggt 
ggcaggttac 
atccttgcag 
gawtccmaat 
tccgytttgc 
tacctggact 
gactgcaatg 
aacaggctac 
tataatatta 
kgkgacyark 
ctacacgggg 
gaagttgcta 
aaagcaaggc 
cgcgaaagct 
tattattata 
ctcgtcgacc 



cgtggattag 
gatgctattg 

gggggaaccg 

ttgactgtag 
gcctttgaag 
gmccgtctgc 
agatgacgtt 
tkkcaaatgt 
cttggagtag 
tttggctcct 
ctggagggaa 
ttggctgatc 
gagattcacg 
acagatccaa 
agtccctcct 
ctttttttag 
ttatatnggc 
cgggaattaa 



gtacgggcgg 
ctgatgctct 
ggtctggtgc 
gtgttgttac 
ccatcgaaag 
ttgacawagy 
ytmsggcaag 
ggattttgca 
gtgtttccty 
ttaattggat 
aggacataac 
cttctgctaa 
tgactatcat 
gggcttgcaa 
cccctcaagt 
ttgcatggtt 
cgatcaaaaa 
ttccggaccg 



<210> 21 
<211> 797 
<212> DNA 
<213> Glycine sp 

<400> 21 

ccagctggcg aaaggggatg tgctgcaagg cgattaagtt gggtacgcag ggttttccca 
gtcacgacgt tgtaaaacga cggcagtgaa ttgaatttag gtgacactat agaagagcta 
tgacgtcgca tgcacgcgta cgtaagctcg gaattcggct cgagaggcta ctttggctcc 
tttaattgga tcctctattc agtcagctac tggggtagtg tataatatta ctggaggaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1085 



60 
120 
180 
240 
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ggacataacc 
ttctgctaat 
gactatcatt 
ggctgcaaag 
cctcaagtcc 
tgcatggttc 
gatcaaaaaa 
gtgagatatc 
ctcgtcgacc 
gtgagccgta 



ctgcaggaag 
attacatttg 
gcaactggct 
ctgcttgaca 
tcaaacaagg 
tcttiaccct 
aaaattatta 
atticticttga 
cgggaattaa 
ttagagg 



tgaacagggt 
gggctgtcgt 
tctcacagtc 
aggtggctga 
ctgaatctag 
ttttcatttt 
tattatattg 
tgttaaaaaa 
ttccggaccg 



ttctcaggtt 
tgatgatcgc 
ttttcagaag 
gggccaagaa 
accatccccg 
tccaattatt 
taggacacaa 
aaaaaaaaag 
gtacctgcag 



gtgactagtt 
tacactgggg 
aagttgccaa 
agcaaggrag 
cgaaagcrct 
attattacat 
tgatcttgat 
ggcggccgcc 
gcgtaccagc 



rggctgatcc 
agatrcacg^ 
cagatccaag 
tccctcctcc 
tttttttagn 
tatatcggcc 
gcttaactaa 
gactagtgag 
tttccccata 



300 
360 
420 
480 
540 
600 
660 
720 
780 
797 



<210> 22 
<211> 714 
<212> DNA 
<213> Glycine sp 

<220> 

<221> misc_f eature 
<222> (1) . . . (714) 
<223> n « A,T,C or G 



<400> 22 

aattcggctc 

tgataatggg 

ctcgaaccag 

atgtcaacaa 

tggttatgag 

gtcca taaag 

ccagcaactt 

gcttttttgc 

caatcccggg 

taccttcttt 

gcatcttcct 

tcttggttgg 



gagacggctg 
gcctctgaag 
ctcacagaaa 
cttcaactat 
ctgaacttga 
agaggcatca 
cactggcctt 
cgcagctgtg 
atggcatctc 
ctgcgaggaa 
ccactcttgg 
ttggaatgng 



cgagaagacg 
ttcttgttcc 
caatgatctc 
cctccattgt 
actccagcaa 
tctccttctt 
cttggatgcc 
ggaaccacct 
tgatgattca 
ccaagtcaaa 
tggtctaatt 
aattaatgna 



acagaagggg 
gttattataa 
ctacgccgac 
cagagatgta 
ccgcaacacC 
ctccgtggat 
ctttttcaac 
tggctatgct 
gaatctatga 
agttaganga 
cttgaaaggg 
ttctaccttt 



gttaccgtta 
aactgagccc 
atgctcaagg 
aactacagct 
tgttctctca 
gagagcaggt 
tccaagcgcc 
tacactttgc 
tatcaaacta 
tatgggcaag 
acagaaacat 
tgacattatg 



tcatgcaagc 
ttcactctct 
gatcacatgg 
gtggctcgtg 
ttgactcaaa 
tcactcagat 
aaagaaccaa 
ctctcaatct 
accgctttgt 
gtttgagact 
attcaccagt 
aagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
714 



<210> 23 

<211> 525 

<212> DNA 

<213> Glycine sp 



<400> 23 

cgggctcgag attactggag gaaaggacat aaccctgcag gaagtgaaca gggtttctca 
ggttgtgact agtttggctg acccttctgc taatattata tttggggctg tcgttgatga 
tcgctacact ggggagattc acgtgactat cattgcaact ggcttctcac agtcttttca 
gaagaagttg ctaacagatc caagggctgc aaagctgctt gacaaggtgg ctgagggcca 
agaaagcaag g'tagtccctc ctcccctcaa gtcctcaaac aaggttgaat ctagaccatc 
cccgcgaaag ctcttttttt agttgcatgg ttctttttac cctttttcat ttttccaatt 
attattatta tattatattg gccgatcaaa aaaaaaatta ttatattata ttgtaggaca 
caacgatctt gatgcttaat taagtgagat atcattccct tgatgttctt tcccctccaa 
aaaaaaaaaa aaagggcggc cgccgactag tgagctcgtc gaccc 



60 
120 
180 
240 
300 
360 
420 
480 
525 



<210> 24 
<211> 1083 
<212> DNA 
<213> Glycine sp 



- 13 - 



WO 00/32799 
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<4C0> 24 

cggctcgagg 

gaatccactt 

taaaggatca 

tgccccagtt 

ctatcccttc 

gctgcagaaa 

tgatgagcag 

gagtacaggg 

atgcaaaagc 

gtaaaaaccg 

ctatttcagt 

scaggaagtg 

ttatatttgg 

caactggctt 

tgcttgacaa 

caaacaaggt 

tttacccttt 

aaaaaaaggg 

acc 



aaccctatta 
ttgggggaac 
gatttggtgt 
gtagcccaaa 
agttttgaag 
aatgttgaca 
atgcctcttc 
aatatcagac 
tgtgatgaaa 
agcagaagaa 
cagctactgg 
aacagggktt 
ggctgtcgtt 
ctcacagtct 
ggtggctgag 
tgaatctaga 
ttcatttttc 
cggccgccga 



aaattggaga 
aagctgcaga 
C tataacggc 
tatcaaaaga 
gacgtaagag 
cacttatagt 
aggatgcttt 
attacamctg 
gactctggga 
gcagccsaac 
gggtagtgta 
ctcaggttgt 
gatgatcgct 
t ttcagaaga 
ggccaagaaa 
ccatccccgc 
caattattat 
ctagtgagct 



agttctgact 
ggaatcaaga 
tgggatgggt 
ggcaggttac 
atccttgcag 
gattccaaat 
tccgtcttgc 
wcctggactt 
ctgcaatgct 
aggctacttt 
taatattact 
gactaagttt 
acacggggga 
agtcgctaac 
gcaaggcagt 
gaaagctctt 
tattatatta 
cgtcgacccg 



cgtggattag 
gatgctattg 
gggggaaccg 
ctgactgtag 
gcctttgaag 
gaccgtctgc 
agatgacgtt 
gtcaatgtgg 
tggagtaggt 
ggctycttta 
ggaggaaagg 
ggctgatcct 
gattcacgtg 
agatccaagg 
ccctcctccc 
tttttagttg 
tattggccga 
ggaattaatt 



gtacgggcgg 
ctgatgctct 
ggtctggtgc 
gtgttgttac 
ccatcgaaag 
ctgacatagc 
ctacggcaag 
atttttgcag 
gtttcctccg 
attggatcct 
acataaccct 
tctgctaata 
actatcattg 
gctgcaaagc 
ctcaagtcct 
catggttctt 
tcaaaaaaaa 
ccggaccggt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1083 



<210> 25 

<211> 1335 

<212> DNA 

<213> Glycine sp 

<220> 

<221> niisc_f eature 
<222> (1) . . . {1335} 
<223> n = A,T,C or G 



<400> 25 

cggctcgagg 

ggtgagatgc 

tggcggcaac 

tgcgataaat 

tggagaagtt 

tgcggaggaa 

aacggctggg 

caaaagaggc 

cgtaagagat 

ttatagttat 

aggatgcttt 

attwtaacag 

gacttctggg 

gcagcagaac 

ggtagtgtat 

tcaggtggtg 

atgatcgctt 

tttcagaaga 

ggccaagaaa 

ccatccccgc 

attattatat 

tcattctctt 



cccagaacaa 
tcctacgctt 
aatgccgtta 
accgatgctc 
ctgactcgtg 
tccagagatg 
atgggtgggg 
aggntacttt 
gcttgcaggc 
tccaaatgat 
ycgytytkca 
gtacctggac 
actgcaatgc 
agggctactt 
aatattactg 
actagtttgg 
acactgggga 
agttgctaac 
gcaaggcagt 
gaaagctctt 
tgtaagtcat 
gatgttaaaa 



caaaaattgc 
acgtagataa 
atcgcatgat 
aggcactatt 
gattaggtac 
ctattgctga 
gaaccgggtc 
gactgtaggt 
ctttgaagcc 
cgtctgcttg 
rawkatgtty 
ttgtmaatgt 
ttggtgtagg 
tggctccttt 
gaggaaagga 
ctgatccttc 
gattcacgtg 
agatccaagg 
ccctcctccc 
tttgtagttg 
tctgtagtac 
aaaaaaaaaa 



tcctcaacgc 
cgccaaaatt 
cggaagtggt 
aaattctgct 
aggtgggaat 
tgctcttaaa 
ttggtgctgc 
gttggtacct 
atcgaaaggc 
acatancttg 
tamcgsaarg 
agattrtgct 
tgtttcatcc 
aattggatca 
cataaccctg 
tgctaatatt 
actataattg 
gctgcaaagc 
cccaagtcct 
catggttctt 
aatgatcttg 
aaaaaaaaaa 



ctaagtcgtc 
aaggttgtcg 
ttgcagggtg 
gctgagaacc 
ccacttttgg 
ggatcagatt 
cccagttgta 
atcccttcag 
tgcagaaaaa 
atgaaccaga 
sgkacaggga 
gatgtaaaam 
ggtaaaaccg 
tctattcagt 
caggaagtga 
atatttggag 
caactggctt 
tgcttgacaa 
caatcaaggt 
ttaccctttt 
atgcttaatt 
aaagggcggc 



gtttcggttc 
gcatcggcgg 
tagacttcta 
ctattaaaat 
gggaacaagc 
tggtatttat 
gnccaaatat 
ttttgaagga 
tgttgcacac 
tgcctattca 
atatcaagac 
ctgkgataaa 
accagaagaa 
cagctactgg 
acagggtttc 
cttgttgttg 
ctcacagtct 
agtggctgag 
tgaatctaga 
cttttttcca 
tagtgagata 
cgccgactag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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1335 

tgagctcgtc gaccc 

<210> 26 
<211> 902 
<212> DNA 
<213> Glycine sp 

<220> 

<221> misc_feature 
<222> (1) . . . (902) 
<223> n = A ( T,C or G 

<400> 26 . - 

aattcggctc gagtaccagg gttggtgaat gtagattttg ctgatgttcg ggctataatg 

gccaatgcag gttcttcact aatggggata ggaactgcaa ctggaaaatc aagggcaaga 

gatgctgcat taaatgccat ccagtcacct ttactggata ttggtatara gagggctact 

Jcgaattgttt ggaacawaac tggtgggact gatctgrcct tgtttgaggt aaacacggca 

gcagaggtta tttatgacct cgtggaccct actgctaatt taatatttgg agcagtaata 

gatccatcac tcagtggtca agtgagcata acattaattg cttactgrat tcaaagcgyc 

aagaggagag tgaagggagg cctctgcagg ccagtcaact cactcaagca gacacaacct 420 

tcggcaccaa ttggcggtct tcctctttca ctgatggtgg tttgtttgag ataccagaat 480 

tcctaaagaa garaggaggt tcacgctatc cgagggcgta atctttttca tcctaatttc 

ttttgatccc ttgcatttct tcacccttgg atatacatag caattggtct agttcttarg 

tccctgtctt gscctttttc ggatttwrkc aaragttgkg. katacagttk gttcatgaaa 

gtttattact tyccactgkc cagacttatg ggkctaaacc gganggtatt ksarcatgga /zu 

tgcttttctt ggcatatttg aattagttta ttagcttgta cagagatttc agtaatgctg 

agagcttgtt atagttcttt ggcatgttat agaaaattca ttattattaa aaaaaaaaaa 

aaaggcggcc gccgactagt gagctcgtcg acccgggaat tnattccgga ccggtacctg 



<210> 27 
<211> 856 
<212> DNA 
<213> Glycine sp 

<220> 

<221> misc_feature 
<222> (1) . . . (856) 
<223> n = A,T,C or G 



60 
120 
180 
240 
300 
360 



540 
600 
660 



840 
900 
902 



60 
120 
180 
240 
300 



<400> 27 

aattcggctc gagattggtg aaccgtagac tttgctgatg ttcgagctat aatggccaat 
gcagggtctt cacttatggg gataggaact gcaactggga aaacaagggc marggawgct 
gcattaaatg ctatccagtc mccctttact ggatatttgg tataraaagg gctactggaa 
ttgtatggaa cataacyggk ggaagtgatt tgaccttgtt tgaaggtaaa tgttgcasca 
raagttatat atgmccttgt ggmccccact gstaatttaa tatttgggsc agwaatagat 
ccatcactcc agtgggcaag taagcatamm wtaatcgcaa ctggattcaa gcgtcaagag 360 
gaaaagtgaa gggagaccct atgcaggcca gtcaactcac acaaggagat nccgttggta 420 
tcaatcggcg atyttctact ttcactgatg gtagcttttg Ltggagatcc ctggaattct 480 
taaagaagaa ggggcgctca cgttatccaa gagtttaata ctcttttccc caactcctta 540 
atccctcctt gcatctcttt mccaascaat ttttagggat acaaatctca tcagtctaag 600 
gtattagatc acggtttttg cccctttttt catttttagg ttcgcattgt gcantamagt 660 
tgttcattga aagcgaagtt actttccaaa accgttgttt tctgarttga aggcttggtt 720 
ggcatgtttt wataagttta ttagcttgta tttttgtnca gagaataata tatcagtaat /au 
ggtcagtgct tgttataaan ccncnaaaaa aaaaaaaaaa aaaagggcgg ccgccgacta 



780 
840 
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gtgagctcgt cgaccc 

<210> 28 

<211> 1060 

<212> DNA 

<213> Glycine sp 

<220> 

<221> misc_feature 
<222> (1) . . . (1060) 
<223> n = A,T.C or G 



856 



<400> 28 

aattcggctc 

acaagaagga 

caaactgctg 

tgatattctt 

gaatgtagac 

gataggaact 

acctttacta 

aaagwgattt 

accccactgy 

gcatcacatt 

acaggccarc 

cactgatggk 

tccgagagct 

tttttaggga 

ttcattttta 

cgttgtttac 

ttgtacagag 

aaaaaaaaag 



gaggtcacaa 
attgctigcat 
actgcagttt 
agacaaggtg 
tttgcagatg 
gcaactggca 
gatattggta 
gaccttgttt 
taatttaata 
aattgcaact 
caatttacac 
agctttgttg 
taatactctt 
tacaaatctc 
ggttcgcatc 
tgagtcgagg 
aatatatcag 
gcggccgccg 



cccctttttc 
taagagataa 
ctcaatctac 
ttcgtggtat 
ttcgagctat 
aaacaagggc 
tagaaagggc 
gaggtaaatg 
tttggagcag 
ggattcaagc 
aaggagatac 
agayccctga 
ctccccaatt 
atcagtctag 
gggcatacxg 
cttgttggca 
taatggtcag 
actagtgagc 



atttgaaggg 
tgttgataca 
ccctgtaact 
atctgatatt 
aatggccaat 
aagagacgct 
taccggaatt 
ctgcagcaga 
taatagatcc 
cgtcaagagg 
ggttggtatc 
attcttaaag 
tcttaatccc 
gtattagatc 
ttgttcaaga 
ggttttaata 
ggcttgttat 
tcgtcgaccc 



cgaagaaggg 
ctgatagtta 
gaagcattca 
attacgatac 
gcaggttctt 
gcattaaatg 
gtatggaaca 
agttatatat 
atcactcagt 
aaaagtgaag 
aaccsgcgat 
aaraaggggc 
ttgatttctt 
ccgttttgcc 
aagcaaagta 
agtttattag 
nnnanncccn 



cagttcaagc 
ttccaaatga 
atctggctga 
caggattggt 
cactcatggg 
ctatccagtc 
taactggtgg 
gaccttgtgg 
ggtcaagtaa 
ggagacctat 
cttcctcttt 
gctcaygtta 
tacaaagtaa 
cctttttttt 
ctttcaaaac 
cttgtatttt 
nnnannaaan 



<210> 29 

<211> 727 

<212> DNA 

<213> Glycine sp 

<220> 

<221> misc_feature 
<222> (1) . . . (727) 
<223> n = A,T,C or 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 



<400> 29 

atctcncaaa 

gggttttant 

attctcagtg 

caggaaagcn 

agtcggtggc 

ccactgctca 

agccttacgc 

catgcaatgc 

aattgctgta 

ggtgcttata 

ctttagggtt 

ggagacagga 



atgcatgncn 
ttgctcttag 
gattgttcac 
ccatggcagg 
aaatgctgaa 
ttaggtgact 
gtagaatttt 
aatcacaata 
cctaacagtg 
ggaaatggaa 
tttgggatta 
ctattactat 



ctgtgtgtgg 
aaaatgtgtt 
caaggtattc 
ggntgaggga 
tacgcgtatc 
tcggcctaag 
cccttaaaaa 
cagagtccta 
agtaaatcta 
atgctagtga 
aagaaaacga 
aaaaaaaaaa 



ca-atattca 
gagcctgcac 
cctcactagg 
ncggtgtana 
acaactccat 
ttgacttgta 
aaaaaatata 
gctagggacc 
gtgaagagaa 
atccttaaat 
aatgtcataa 
aaaaagggcg 



aaatgacttg 
atanaagatt 
gaatcagggt 
aaggagtggc 
tggaattgat 
aacatattgt 
ttcctatgta 
aaacatcatt 
ttattattgc 
tggaggctga 
ttatcatacc 
gccgccgact 



gcccagggtg 
ggagttgttg 
gantctcaaa 
catgttccag 
acatctaatt 
tactaccctt 
acgttacgta 
tcgatgtaga 
tgctaacgaa 
caacgaagtt 
cttgggatga 
agtgagctcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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tcgaccc 

<210> 30 

<211> 1185 

<212> DNA 

<213> Glycine sp 

<22Q> 

<221> misc_f eature 
<222> (1) . . . (1185) 
<223> n = A,T,C or G 



727 



<400> 30 

cggctcgagc 

ttgcaaagtc 

ggagaaagag 

cgcttatagt 

ctgaagcatt 

ttattacaat 

acgcaggttc 

ctgcattaaa 

ttgtttggaa 

rraggttatt 

tccatcactc 

ggarartgaa 

caccaattgg 

aaagaagaga 

atcccttgca 

tcttgccctt 

tacttcccac 

ttggcatatt 

ttatagttct 

aaaaaaaaaa 



tggaatgggt 
aatgggtata 
atctattcaa 
tattccaaat 
caatctggct 
accagggttg 
ttcactaatg 
tgccatccag 
cawaactggk 
tatgacctcg 
agtggkcaag 
rggaggcctn 
cggtcttcct 
ggaggttcac 
tttcttcacc 
tttcggattt 
tgtccagact 
tgaattagtt 
ttggcatgtt 
aaagggcggc 



gggggaactg 
ttgacggttg 
gcccaagaag 
gacaagctac 
gatgatattc 
gtgaatgtag 
gggataggaa 
tcaccwttmc 
gggactgacc 
tggaccctac 
tgagcataac 
Lgcaggccag 
ctttcactga 
gctatccgag 
cttggatata 
tagtcagagt 
tatgggtcta 
tattagcttg 
atagaaaatt 
cgccgactag 



gcacaggtgg 
gtattgtcac 
gaa^tacagc 
taacggcagt 
ctcgacaggg 
attttgctga 
ctgcaactgg 
tggatattgg 
ttgaccctgt 
tgctaattta 
attaattgct 
tcaactcact 
tggtggtttg 
ggcgtaatct 
catagcattg 
tgtgtataca 
accggaggta 
tacagagatt 
cattattatt 
tgagctcgtc 



agctccaatt 
cacccctttc 
cttaagagat 
ttctcaatct 
tgttcgtggc 
tgttcgggct 
aaaatcaagg 
ta^agararg 
ttgaggtiaaa 
atatttggag 
actggattca 
caagcagaca 
tttgagatac 
ttttcatcct 
gtctagttct 
gtttgttcat 
ttgcagcatg 
tcagtaatgc 
attcatcccn 
gaccc 



attgctagta 
tcgtttgaag 
aatgttgaca 
acccctgtaa 
atatctgata 
ataatggcca 
gcaagagatg 
gctactggaa 
cacggcarca 
cagtaataga 
agcgtcaaga 
caaccttcgg 
cagaattcct 
aatttctttg 
taggtccctg 
gaaagtttat 
gatgcttttc 
tgagagcttg 
ccaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1185 



<210> 31 

<211> 700 

<212> DNA 

<213> Glycine sp 

<220> 

<221> misc_feature 
<222> (1) . . . (700) 
<223> n = A,T,C or G 



<400> 31 

aattcggctc 

agcccaagaa 

tgacaagcta 

tgatgatatt 

tggtgaatgt 

tggggatagg 

agtcaccttt 

gtgggactga 

ggccctactg 

cattnantgc 



gagattgtca 
ggaattacag 
ctaacggcag 
cttcgacagg 
agattttgct 
aactgcaact 
actggatatt 
tctgccttgt 
ctaattaata 
tctggattca 



ccaccccttt 
ccttaagaga 
tttctcaatc 
gtggtccgtg 
gatgttcggg 
ggaaaatcaa 
ggtatagaga 
ttgaggtaaa 
tttggagcag 
agcgtcaaga 



ctcgtttgaa 
taatgttgac 
tacccctgta 
gcatatctga 
ctataatggc 
gggcaagaga 
gggctactgg 
cacngcagca 
aatagatcca 
ngagaagtga 



gggagaaaga 
acgcttatag 
actgaagcat 
tattattaca 
caatgcaggt 
tgctgcatta 
aattgtttgg 
ganggtattt 
tcctcatggc 
agggangcct 



gatctattca 
ttattccaaa 
tcaatctggc 
ataccagggt 
tcttcactaa 
aatgccatcc 
aacataactg 
atgacctcgn 
aagtgacata 
ttgcaggcca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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gcactcactc agcagacaca accttngnac caattggcgg cttcctcttt cactgatggg 
nggttggttg agatncnana attcctaaag aaaaanagag 



660 
700 



<210> 32 
<211> 1425 
<212> DNA 

<213> Arabidopsis sp 

<400> 32 . 
ccacgcgtcc ggaggaagta aacaatggcg ataattccgt tagcacagct taatgagcta 
acgatttctt catcttcttc ttcgtttctt accaaatcga tatcttctca ttcgttgcat 
agtagctgca tttgcgcaag ttctagaatc agtcaattcc gtggcggctt ctctaaacga 
agaagcgatt caacaaggtc taagtcgatg cgattgaggt gttccttctc tccgatggaa 240 
tctgcgagaa ttaaggtgat tggtgtcggt ggtggtggta acaatgccgt taaccggatg 3 00 

atttcaagcg gtttacagag tgttgatttc tatgcgataa acacggattc gcaagctctg 
ttacagtttt ctgctgagaa cccacttcaa attggagaac ttttaactcg tgggcttggc 
actggtggaa acccgcttct tggagaacaa gctgcagaag aatcaaaaga tgcaattgct 
aatgctctta aaggatcaga ccttgttttc ataactgctg gtatgggtgg tggaacaggg 540 
tctggtgctg cacctgtggt agctcagatt tcgaaggatg ctggttattt gactgttggt 600 
gttgttacct atccgtttag ctttgaagga cgtaaaagat ctttgcaggc actggaagct 660 
attgaaaagc tccaaaagaa tgttgatacc cttatcgtga ttccaaatga tcgtctgcta 72 0 

gatattgctg atgaacagac gccacttcag gacgcgtttc ttcttgcaga tgatgtttta 
cgccaaggag tacaaggaat ctcagatatt attactatac ctggactagt caatgtggat 
tttgcagatg tgaaggcagt catgaaagat tctggaactg caatgctcgg ggtaggtgtt 
tcttccagca aaaaccgggc agaagaagca gctgaacaag caactttggc tccattgatc 
ggatcatcca tacaatcagc tactggtgtc gtctacaaca tcactggtgg aaaagacata 
actttgcagg aagtgaaccg agtatcacag gtcgtgacaa gtttggcaga cccatcggcc 
aacatcatat ttggagctgt tgtggatgat cgctacaccg gagagattca tgtaacgata 1140 
atcgccacag gcttctctca gtcattccag aagacacttc tgactgatcc aagagcagct 1200 
aaactccttg acaaaatggg atcatcaggt caacaagaga acaaaggaat gtctctgcct 1260 
caccagaagc agtctccatc aactatctct accaaatcgt cttctccccg tagacttttc 1320 
ttctagtttt ctttttttcc ttttcggttt caagcatcaa aaatgtaacg atcttcaggc 1380 
tcaaatatca attacatttg attttcctcc aaaaaaaaaa aaaaa J-^a 

<210> 33 
<211> 1611 
<212> DNA 

<213> Arabidopsis sp 

<400> 33 />r\ 
tgttgttgcc gctcagaaat ctgaatcttc tccaatcaga aactctccac ggcattacca 60 
aagccaagct caagatcctt tcttgaacct tcacccggaa atatctatgc ttagaggtga 120 
agggactagt acaatagtca atccaagaaa ggaaacgtct tctggacctg ttgtcgagga 180 
ttttgaagag ccatctgctc cgagtaacta caatgaggcg aggattaagg ttattggtgt 240 
gggaggtggt ggatcaaatg ctgtgaatcg tatgatagag agtgaaatgt caggtgtgga 3 00 

g??c?gga?t |?caacactg atatccaggc tatgagaatg tctcctgttt tgcctgataa 360 
taggttacaa attggtaagg agttgactag gggtttaggt gctggaggaa atccagaaat 420 
cggtatgaat gctgctagag agagcaaaga agttattgaa gaagctcttt atggctcaga 480 
tatggtcttt gtcacagctg gaatgggcgg tggaactggc actggtgcag cccctgtaat 540 
tgcaggaatt gccaaggcga tgggtatatt gacagttggt attgccacaa cgcctttctc 
gtttgagggt cgaagaagaa ctgttcaggc tcaagaaggg cttgcatctc tcagagacaa 
tgttgacact ctcatcgtca ttccaaatga caagttgctt acagctgtct ctcagtctac 
tccggtaaca gaagcattta atctagctga tgatatactc cgtcaggggg ttcgtgggat 
atctgatatc attacgattc ctggtttggt gaatgtggat tttgctgatg tgagagctat 
aatggcaaat gcggggtctt cattgatggg aataggaact gcgacaggaa agagtcgggc 



60 
120 
180 



420 
480 



780 
840 
900 
960 
1020 
1080 



660 
720 
780 
840 
900 
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960 



aagagatgct gcgctaaatg caatccaatc ccctttgtta gatattggga ^agagagc 
cactggaa.t gtttggaaca ttactggcgg aagtgacttg »«ttgc«g «gtaaatgc 1020 
tgctgcggaa gtaatatatg atcttgtcga tccaactgcc aatcttatat ^cgtgctgtt x 
g'agatccag ccctcagcgg tcaagtaagc ataaccctga tagctacggg tttcaaacga 
caagaagagg gagaaggacg aacagttcag atggtacaag cagatgctgc £cagttgga 
gctacaagaa gaccctcttc ttcctttaga gaaagcggtt cagtggagat cccagagttc 
ttgaagaaga aaggcagctc tcgttatccc cgagtctaaa gcccaatcta atcactaccc 
tgcacactgc agSaataaca aacgtgtgtg tactggtagt ctggtactgc cttctgggat 
acagcaagat gtgttgatgt atgatcaaga atctgtgtgg ^gtgtatat gtctgtcac 
tgcctctggt cgtgttcttg aataggttgt tttagaaatc ggagtttctc tctatgtcac 
ttccaaaaca aaaaaggaga agaagaatca cacttctcga accataaaca tacttataag 
attatgagag ttttagcaga aattattgtc aaaaaaaaaa aaaaaaaaaa a 



<210> 34 
<211> 299 
<212> DNA 

<213> Arabidopsis sp 
<220> 

<221> misc_feature 
<222> (IS . . . (299) 
<223> n = A, T, C or G 



agtaattgaa aaatgacaca tacctctttt nctacaatca aagtttataa ctaagaagaa 
?aaatgcaat agtaataata aagatttgag gacatactgt gacaaagacc ^anctgagc 
cataaagagc ttctneaat. actgctttgc tctctctagc agcattcata ccgatttcng 
gatttcctcc agcacctaaa cccctagtca actccttacc aatttgtaac ctattatcag 
gcaaaacagg agacattctc atagcctgga tatcagtgtt gacaatccag aactccaca 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1611 



<210> 35 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 35 25 
cttgccgcaa aacatcatcc gcgag 

<210> 36 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 36 28 
cacccacccc gagcatcgca gttccgga 



<210> 37 
<211> 41 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 37 

ttggtgtcgg tggtggtggt aacaatgccg ttaaccggat g 

<210> 38 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 38 

ggtaacaatg ccgttaaccg gatgatttca agcggtttac a 

<210> 39 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 39 

gctgctcttg gatcagtcag aagtgtcttc tggaatgact g 

<210> 40 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 40 

tttgtcaagg agtttagctc ctcttggatc agtcagaag 

<210> 41 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 41 

cagcaccaga tcctgtacct ccacccat 

<210> 42 
<211> 30 
<212> DNA 



PCT/US99/28103 



41 



41 



41 



39 
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<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 42 

ctgaatgggt atgtgacaac accaacagtc 

<210> 43 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 43 

gctgatgatg tattacgcca aggtgtcc 

<210> 44 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<22C> 

<223> Synthetic Oligonucleotide 
<400> 44 

ctggaactgc tatgcttgga gttggggt 

<210> 45 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 45 

tcgaaactga atggataggt gacaacacca acag 

<210> 46 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 46 

tggatcgaca aggtcgtaga taatttcggc cgcagc 

<210> 47 
<211> 56 
<212> DNA 
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<213> Artiticial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 47 

cgcgatttaa atggcgcgcc ctgcaggcgg 

<210> 48 
<211> 32 
<212> DNA 

<213> Artificial Sequence 



ccgcctgcag ggcgcgccat ttaaat 



<220> 

<223> Synthetic Oligonucleotide 
<400> 48 

tcgaggatcc gcggccgcaa gcttcctgca gg 

<210> 49 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<40C> 49 

tcgacctgca ggaagcttgc ggccgcggat cc 

<210> 50 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 50 

tcgacctgca ggaagcttgc ggccgcggat cc 

<210> 51 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 51 

tcgaggatcc gcggccgcaa gcttcctgca gg 



<210> 52 
<211> 36 
<212> DNA 
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<2i3> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 52 

tcgaggatcc gcggccgcaa gcttcctgca ggagct 

<210> 53 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 53 

cctgcaggaa gcttgcggcc gcggatcc 

<210> 54 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<40C> 54 

tcgacctgca ggaagcttgc ggccgcggat ccagct 

<210> 55 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 55 

ggatccgcgg ccgcaagctt cctgcagg 

<210> 56 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> c-rayc Tag 
<400> 56 

Glu Gin Lys Leu He Ser Glu Glu Asp Leu 
1 5 10 

<210> 57 
<211> 37 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 57 37 
acgtgcggcc gcatggcgat aattccgtta gcacagc 

<210> 5B 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 58 37 
actgataagc ttttgctcga agaaaagtct acgggga 

<210> 59 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 59 37 
acgtgcggcc gcatggcgat aattccgtta gcacagc 

<210> 60 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 60 44 
cgtcctgcag gctacaagtc ttcctcactg ataagctttt gctc 

<210> 61 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 61 37 
acgtcctgca ggatggcgat aattccgtta gcacagc 

<210> 62 
<211> 38 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 62 

acgtgcggcc gcctagaaga aaagtctacg gggagaag 

<210> 63 
<211> 32 
<212> DNA 

<213> Artificial Sec/uence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 63 

acgtaagctt tccttctctc cgatggagtc tg 

<210> 64 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 64 

acgtctgcag cttttcaatg gcttcaagtg cct 

<210> 65 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 65 

acgtctgcag aagaacgtgg ataccctcat cgtg 

<210> 66 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 66 

acgtgcggcc gcctagaaga acaatctacg gggagaag 

<210> 67 
<211> 244 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Prrn/NEP/GenelO leader/ 14 amino acid from GFP 
fusion 



<400> 67 

gaattcggta cccccgtcgt tcaatgagaa tggataagag gctcgtggca ttgacgtgag 
ggggcaggga tggctatatt tctgggagcg aactccgggc gaatactgaa gcgcttggat 
acaagttatc cttggaagga aagacaattc cggatcctct agaaataatt ttgtttaact 
ttaagaagga gatataccca tgggtaaagg agaagaactt ttcactggag ttgtcccaag 
catg 



60 
120 
180 
240 
244 



<210> 68 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 68 

gtcctgcagg atggccacca tctcaaaccc ag 

<210> 69 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 69 

tagcggccgc ctatataaag gagctaaaag aacagcc 

<210> 70 

<211> 669 

<212> DNA 

<213> Arabidopsis sp 



32 



37 



<400> 70 

atggcgatta 

acatcccact 

gcaaacacgg 

tgtcggcggt 

tgatttctat 

tcttcaaatt 

agaacaagct 

tgytttcatt 

tcagatctcc 

cgaaggtcgt 

ggataccctc 

tcttcaaga 



gtccgttggc 
cgctgcacag 
ttgagatgct 
ggtggtaaca 
gcgataaaca 
ggagagctcc 
gctgaggaat 
actgctggta 
aaagacgctg 
aaaagatctt 
atcgtgatac 



acagcttaac 
taccagaatc 
ccttctctcc 
atgccgtcaa 
cggactctca 
taactcgtgg 
ctaaagacgc 
tgggtggtgg 
gttatttgac 
tgcaggcact 
caaatgatcg 



gagctaccag 
agtggcgggc 
gatggagtct 
tcgcatgatt 
agctctcttg 
ccttgggact 
gattgctaat 
cactggctcc 
cgttggtgtt 
tgaagccatt 
tctcctagat 



tctcttcctc 
ttctcaaaac 
gcgaggatta 
tccagcggct 
cagtcttctg 
ggtgggaacc 
gctcttaaag 
ggtgctgctc 
gttacctatc 
gaaaagctgc 
attgctgatg 



gtttsttgcg 
aaaggtttaa 
aggtggttgg 
tacagagtgt 
cgcagaaccc 
cgcttctagg 
gatctgacct 
ctgttgttgc 
ccttcagctt 
agaakaacgt 
aacagacgcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
669 
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<210> 71 

<211> 646 

<212> DNA 

<213> Nicotiana sp 



<400> 71 

ggccctctag 

ttaagcagtg 

aacagcaatg 

tttttaccac 

cttatgtaaa 

gattaaggtt 

tggcttacag 

tgctgctgaa 

caatcctctt 

aaaaggttca 

tgaagggcga 



atgcatgc tc 
gtaacaacgc 
gccaccatct 
tcttccttta 
cctcaacgtt 
atcggcgtcg 
ggtgttgact 
aatccacttc 
ttaggggaac 
gatatggtgt 
attccagcac 



gagcggccgc 
agagtacgcg 
caaacccagc 
ttcctaaaca 
tcagcatttc 
gtggcggtgg 
tctatgctat 
aaattggaga 
aggcagcaga 
tcataacagc 
accggcggcc 



cagtgtgatg 
ggggtaaacc 
agagatagca 
atgctgcttc 
aagttcattt 
caacaatgcc 
aaacacggat 
gcttctgact 
ggagtcgaag 
aggaatgggt 
gttactagtg 



gatatctgca 
aaacagacag 
gctt:cttctc 
accaaagctc 
actccttttg 
gttaaccgta 
gctcaagcac 
cgtgggcttg 
gaagccattg 
ggaggtacag 
gatccg 



<210> 72 

<211> 325 

<212> PRT 

<213> Glycine sp 



gaattcgccc 
agagagcaga 
cttcctttgc 
gccggaaaag 
attctgctaa 
tgattggcag 
tgctgcagtc 
gtactggtgg 
caaattctct 
gatctggtgc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
646 



<400> 72 












Gly 


Ser 


Arg 


Pro 


Arg 

5 


Thr 


Thr 


Lys 


1 

Arg 


Phe 


Gly 


Ser 


Val 


Arg 


Cys 


Ser 








20 










lie 


Lys 


Val 


Val 


Gly 


lie 


Gly 


Gly 




35 










40 


Met 


lie 


Gly 


Ser 


Gly 


Leu 


Gin 


Gly 




50 










55 




Asp 


Ala 


Gin 


Ala 


Leu 


Leu 


Asn 


Ser 


65 










70 






Gly Glu Val 


Leu 


Thr 


Arg 


Gly 


Leu 










85 








Gly Glu Gin 


Ala 


Ala 


Glu 


Glu 


Ser 








100 










Lys 


Gly 


Ser 


Asp 


Leu 


Val 


Phe 


He 




115 










120 


Gly 


Ser Gly 


Ala 


Ala 


Pro 


Val 


Val 


130 










135 




Tyr 


Leu 


Thr 


Val 


Gly 


Val 


Val 


Thr 


145 










150 






Lys 


Arg 


Ser 


Leu 


Gin 


Ala 


Phe 


Glu 








165 








Val 


Asp 


Thr 


Leu 


He 


Val 


He 


Pro 








180 










Asp 


Glu 


Gin 


Met 


Pro 


Leu 


Gin 


Asp 




195 










200 


Leu 


Arg 


Gin 


Gly 


Val 


Gin 


Gly 


He 




210 










215 




Leu 


Val 


Asn 


Val 


Asp 


Phe 


Ala 


Asp 


225 










230 






Gly 


Thr 


Ala 


Met 


Leu 


Gly 


Val 


Gly 



He 


Ala 


Pro 


Gin 


Arg 


Leu 


Ser 


Arg 


10 










15 




Tyr 


Ala 


Tyr 


Val 


Asp 


Asn 


Ala 


Lys 


25 










30 






Gly 


Gly 


Asn 


Asn 


Ala 


Val 


Asn 


Arg 






45 








Val 


Asp 


Phe 


Tyr 


Ala 


He 


Asn 


Thr 






60 










Ala 


Ala 


Glu 


Asn 


Pro 


He 


Lys 


He 




75 










80 


Gly 


Thr 


Gly 


Gly 


Asn 


Pro 


Leu 


Leu 


90 










95 




Arg 


Asp 


Ala 


He 


Ala 


Asp 


Ala 


Leu 


105 








110 






Thr 


Ala 


Gly 


Met 


Gly Gly Gly 


Thr 










125 








Ala 


Gin 


He 


Ser 
140 


Lys 


Glu 


Ala 


Gly 


Tyr 


Pro 


Phe 


Ser 


Phe 


Glu 


Gly 


Arg 




155 










160 


Ala 


He 


Glu 


Arg 


Leu 


Gin 


Lys 


Asn 




170 










175 


Ala 


Asn 


Asp 


Arg 


Leu 


Leu 


Asp 


He 


185 








190 




Val 


Ala 


Phe 


Pro 


Phe 


Ala 

205 


Asp 


Asp 


Ser 


Asp 


He 


He 


Thr 


Val 


Pro Gly 






220 










Val 


Lys 


Ala 


Val 


Met 


Lys Asp 


Ser 




235 










240 


Val 


Ser 


Ser 


Gly 


Lys 


Asn 


Arg 


Ala 
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245 








Glu 


Glu 


Ala 


Ala 


Glu 


Gin 


Ala 


Thr 








260 










He 


Gin 


Ser 


Ala 


Thr 


Gly 


Val 


Val 




275 










280 


He 


Thr 


Leu 


Gin 


Glu 


Val 


Asn 


Arg 




290 










295 




Ala 


Asp 


Pro 


Ser 


Ala 


Asn 


He 


lie 


305 










310 






Tyr Thr Gly Glu 


He 
















325 









250 255 
Leu Ala Pro Leu He Gly Ser Ser 
265 270 
Tyr Asn He Thr Gly Gly Lys Asp 
2 85 

Val Ser Gin Val Val Thr Ser Leu 
300 

Phe Gly Ala Val Val Asp Asp Arg 
315 320 



<210> 73 
<211> 357 
<212> PRT 
<213> Zea mays 



<220> 

<221> VARIANT 

<222> (1) . . . (357) 

<223> Xaa = Any Amino Acid 



Asp°Leu 3 Val Phe He Thr Ala Gly Met Gly Gly Gly Thr Gly Ser Gly 

1 5 1° 15 

Ala Ala Pro Val Val Ala Gin He Ser Lys Glu Ala Gly Tyr Leu Thr 

20 25 JO 

Val Gly Val Val Thr Tyr Pro Phe Ser Phe Glu Gly Arg Lys Arg Ser 

35 40 45 

Val Gin Ala Leu Glu Ala Leu Glu Lys Leu Glu Lys Ser Val Asp Thr 

50 55 60 

Leu He Val He Pro Asn Asp Lys Leu Leu Asp Val Ala Asp Glu Asn 
65 70 75 8 

Met Pro Leu Gin Asp Ala Phe Leu Leu Ala Asp Asp Val Leu Arg Gin 

85 90 95 

Gly Val Gin Gly He Ser Asp He He Thr He Pro Gly Leu Val Asn 

100 105 - 10 

Val Asp Phe Ala Asp Val Lys Ala Val Met Lys Asn Ser Gly Thr Ala 

115 120 3-25 

Met Leu Gly Val Gly Val Ser Ser Ser Lys Asn Arg Ala Gin Glu Ala 

130 135 140 

Ala Glu Gin Ala Thr Leu Ala Pro Leu He Gly Ser Ser Tie Glu Ala 
145 150 I 55 ieU 

Ala Thr Gly Val Val Tyr Asn He Thr Gly Gly Lys Asp He Thr Leu 

165 170 175 

Gin Glu Val Asn Lys Val Ser Gin He Val Thr Ser Leu Ala Asp Pro 

180 185 190 

Ser Ala Asn He He Phe Gly Ala Val Val Asp Asp Arg Tyr Thr Gly 

195 200 205 

Glu He His Val Thr He He Ala Thr Gly Phe Pro Gin Ser Phe Gin 

210 215 220 

Lys Ser Leu Leu Ala Asp Pro Lys Gly Ala Arg He Val Glu Ser Lys 
225 230 235 

Glu Lys Ala Ala Thr Leu Ala His Lys Ala Ala Ala Ala Ala Val Gin 
245 250 255 
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Pro Val Pro Ala Ser Ala 
260 

His Leu Val Asn Arg Asp 
275 

Leu Arg Ala Val Pro Lys 
290 

Leu Gly Xaa Cys Ser Val 
305 310 
Leu Leu Cys Phe Met Phe 
325 

Cys Ser Ala Thr Gin His 
340 

He Ala Ala Thr Cys 
355 



Trp Ser Arg Arg Leu Phe 
265 

Ser Xaa Cys He Arg Phe 
280 

Val He Phe Gly Tyr Leu 
295 300 
Val Val Glu Xaa Val Ser 
315 

Leu Arg He Xaa Arg Xaa 
330 

Xaa Thr Val Xaa Lys He 
345 
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Ser 


Xaa 


Glu 


Ala 




270 






Ala 


Phe 


Ser 


Val 


285 








Glu 


He 


Tyr 


Ser 


Val 


Tyr 


Val 


Ser 








320 


Gly Xaa 


Glu 


Lys 






335 




Phe 


Asp 


Cys 


Phe 




350 
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